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Preface to the Third Edition 


The development of electron energy-loss spectroscopy within the last 15 years has 
been remarkable. This progress is partly due to improvements in instrumentation, 
such as the successful correction of spherical (and more recently chromatic) aberra- 
tion of electron lenses, allowing sub-Angstrom spatial resolution in TEM and STEM 
images and (in combination with Schottky and field-emission sources) much higher 
current in a focused probe. The incorporation of monochromators in commercial 
TEMs has improved the energy resolution to 0.1 eV, with further improvements 
promised. These advances have required close attention to the mechanical and elec- 
trical stability of the TEM, including thermal, vibrational, and acoustical isolation. 
The energy-loss spectrometer has been improved with a fast electrostatic shutter, 
allowing millisecond acquisition of an entire spectrum and almost simultaneous 
recording of the low-loss and core-loss regions. 

Advances in computer software have made routine such processes as spectral 
and spatial deconvolution, spectrum-imaging, and multivariate statistical analysis. 
Programs for implementing density functional and multiple-scattering calculations 
to predict spectral fine structure have become more widely available. 

Taken together, these improvements have helped to ensure that EELS can be 
applied to real materials problems as well as model systems; the technique is no 
longer mainly a playground for physicists. Another consequence is that radiation 
damage is seen to be a limiting factor in electron beam microanalysis. One response 
has the development of TEMs that can achieve atomic resolution at lower accelerat- 
ing voltage, in an attempt to maximize the information/damage ratio. There is also 
considerable interest in the use of lasers in combination with TEM-EELS, the aim 
being picosecond or femtosecond time resolution, in order to study excited states 
and perhaps even to conquer radiation damage. 

For the third edition of this textbook, I have kept the previous structure intact. 
However, the reading list and historical section of Chapter | have been updated. 
In Chapter 2, I have retained but shortened the discussion of serial recording, 
making room for more information on monochromator designs and new electron 
detectors. In Chapter 3, I have added material on energy losses due to elastic scat- 
tering, retardation and Cerenkov effects, core excitation in anisotropic materials, 
and the delocalization of inelastic scattering. Chapter 4 now includes a discussion 
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of Bayesian deconvolution, multivariate statistical analysis, and the ELNES simula- 
tion. As previously, Chapter 5 deals with practical applications of EELS in a TEM, 
together with a discussion of factors that limit the spatial resolution of analysis, 
including radiation damage and examples of applications to selected materials sys- 
tems. The final section gives examples of TEM-EELS study of electronic, ceramic, 
and carbon-based materials (including graphene, carbon nanotubes, and polymers) 
and the measurement of radiation damage. 

In Appendix A, the discussion to relativistic effects is extended to include recent 
theory relating to anisotropic materials and magic-angle measurements. Appendix 
B contains a brief description of over 20 freeware programs written in MATLAB. 
They include programs for first-order prism focusing, atomic-displacement cross 
sections, Richardson—Lucy deconvolution, the Kroger formula for retardation and 
surface losses, and translations of the FORTRAN and BASIC codes given in the 
second edition. The table of plasmon energies in Appendix C has been extended to 
a larger number of materials and now also contains inelastic mean free paths. I have 
added an Appendix F that summarizes some of the choices involved in acquiring 
energy-loss data, with references to earlier sections of the book where these choices 
are discussed in greater detail. 

Throughout the text, I have tried to give appropriate references to topics that I 
considered outside the scope of the book or beyond my expertise. The reference list 
now contains about 1200 entries, each with an article title and page range. They 
are listed alphabetically by first author surname, but with multiauthor entries (et al. 
references in the text) arranged in chronological order. 

I am grateful to many colleagues for comment and discussion, including Les 
Allen, Phil Batson, Gianluigi Botton, Peter Crozier, Adam Hitchcock, Ferdinand 
Hofer, Archie Howie, Gerald Kothleitner, Ondrej Krivanek, Richard Leapman, 
Matt Libera, Charlie Lyman, Marek Malac, Sergio Moreno, David Muller, Steve 
Pennycook, Peter Rez, Peter Schattschneider, Guillaume Radtke, Harald Rose, John 
Spence, Mike Walls, Masashi Watanabe, and Yimei Zhu. I thank Michael Bergen for 
help with the MATLAB computer code and the National Science and Engineering 
Council of Canada for continuing financial support over the past 35 years. Most 
of all, I thank my wife Maia and my son Robin for their steadfast support and 
encouragement. 


Edmonton, AB, Canada Ray Egerton 
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Chapter 1 
An Introduction to EELS 


Electron energy-loss spectroscopy (EELS) involves analyzing the energy distribu- 
tion of initially monoenergetic electrons after they have interacted with a specimen. 
This interaction is sometimes confined to a few atomic layers, as when a beam of 
low-energy (100-1000 eV) electrons is “reflected” from a solid surface. Because 
high voltages are not involved, the apparatus is relatively compact but the low 
penetration depth implies the use of ultrahigh vacuum. Otherwise information is 
obtained mainly from the carbonaceous or oxide layers on the specimen’s surface. 
At these low primary energies, a monochromator can reduce the energy spread of 
the primary beam to a few millielectron volts (Ibach, 1991) and if the spectrometer 
has comparable resolution, the spectrum contains features characteristic of energy 
exchange with the vibrational modes of surface atoms, as well as valence electron 
excitation in these atoms. The technique is therefore referred to as high-resolution 
electron energy-loss spectroscopy (HREELS) and is used to study the physics and 
chemistry of surfaces and of adsorbed atoms or molecules. Although it is an impor- 
tant tool of surface science, HREELS uses concepts that are somewhat different to 
those involved in electron microscope studies and will not be discussed further in 
the present volume. The instrumentation, theory, and applications of HREELS are 
described by Ibach and Mills (1982) and by Kesmodel (2006). 

Surface sensitivity is also achieved at higher electron energies if the electrons 
arrive at a glancing angle to the surface, so that they penetrate only a shallow 
depth before being scattered out. Reflection energy-loss spectroscopy (REELS) has 
been carried out with 30-keV electrons in a molecular beam epitaxy (MBE) cham- 
ber, allowing elemental and structural analysis of the surface during crystal growth 
(Atwater et al., 1993). Glancing angle REELS is also possible in a transmission 
electron microscope (TEM) at a primary energy of 100 keV or more, as discussed 
in Section 3.3.5. 

If the electrons arrive perpendicular to a sufficiently thin specimen and their 
kinetic energy is high enough, practically all of them are transmitted without reflec- 
tion or absorption. Interaction then takes place inside the specimen, and information 
about its internal structure can be obtained by passing the transmitted beam into a 
spectrometer. For 100-keV incident energy, the specimen must be less than | ~m 
thick and preferably below 100 nm. 
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Such specimens are self-supporting only over limited areas, so the incident elec- 
trons must be focused into a small diameter, but in doing so we gain the advantage 
of analyzing small volumes. Electron lenses that can focus the electrons and guide 
them into an electron spectrometer are already present in a transmission electron 
microscope, so transmission EELS is usually carried out using a TEM, taking 
advantage of its imaging and diffraction capabilities to identify the structure of the 
material being analyzed. 

In this introductory chapter, we present a simplified account of the physical pro- 
cesses that occur while “fast” electrons are passing through a specimen, followed 
by an overview of energy-loss spectra and the instruments that have been developed 
to record such spectra. To identify the strengths and limitations of EELS, we con- 
clude by considering the alternative techniques that are available for analyzing the 
chemical and physical properties of a solid specimen. 


1.1 Interaction of Fast Electrons with a Solid 


When electrons enter a solid, they interact with the constituent atoms through elec- 
trostatic (Coulomb) forces. As a result of these forces, some of the electrons are 
scattered: the direction of their momentum is changed and in many cases they trans- 
fer an appreciable amount of energy to the specimen. It is convenient to divide the 
scattering into two broad categories: elastic and inelastic. 

Elastic scattering involves Coulomb interaction with atomic nuclei. Each nucleus 
represents a high concentration of charge; the electric field in its immediate vicinity 
is intense and an incident electron that approaches close enough is deflected through 
a large angle. Such high-angle deflection is referred to as Rutherford scattering, 
since its angular distribution is the same as that calculated by Rutherford for the 
scattering of alpha particles. If the deflection angle exceeds 90°, the electron is said 
to be backscattered and may emerge from the specimen at the same surface that it 
entered (Fig. |.1a). 

The majority of electrons travel further from the center of an atom, where the 
electrostatic field of the nucleus is weaker because of the inverse square law and 
the fact that the nucleus is partially shielded (screened) by atomic electrons. Most 
incident electrons are therefore scattered through small angles, typically a few 
degrees (10-100 mrad) in the case of 100-keV incident energy. In a gas or (to a 
first approximation) an amorphous solid, each atom or molecule scatters electrons 
independently. In a crystalline solid, the wave nature of the incident electrons cannot 
be ignored and interference between scattered electron waves changes the contin- 
uous distribution of scattered intensity into one that is sharply peaked at angles 
characteristic of the atomic spacing. The elastic scattering is then referred to as 
diffraction. 

Although the term elastic usually implies negligible exchange of energy, this 
condition holds only when the scattering angle is small. For a head-on colli- 
sion (scattering angle = 180°), the energy transfer is given (in electron volt) by 
Emax = 2148 (Eo + 1.002)Ep/A, where Ep is the incident energy in megaelectron 
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(a) (b) 


Fig. 1.1 A classical (particle) view of electron scattering by a single atom (carbon). (a) Elastic 
scattering is caused by Coulomb attraction by the nucleus. Inelastic scattering results from 
Coulomb repulsion by (b) inner-, or (c) outer-shell electrons, which are excited to a higher energy 
state. The reverse transitions (de-excitation) are shown by broken arrows 


volt and A is the atomic weight of the target nucleus. For Eg = 100 keV, Emax > 1 eV 
and, in the case of a light element, may exceed the energy needed to displace the 
atom from its lattice site, resulting in displacement damage within a crystalline sam- 
ple, or removal of atoms by sputtering from its surface. However, such high-angle 
collisions are rare; for the majority of elastic interactions, the energy transfer is 
limited to a small fraction of an electron volt, and in crystalline materials is best 
described in terms of phonon excitation (vibration of the whole array of atoms). 

Inelastic scattering occurs as a result of Coulomb interaction between a fast inci- 
dent electron and the atomic electrons that surround each nucleus. Some inelastic 
processes can be understood in terms of the excitation of a single atomic electron 
into a Bohr orbit (orbital) of higher quantum number (Fig. 1.1b) or, in terms of 
energy band theory, to a higher energy level (Fig. 1.2). 

Consider first the interaction of a fast electron with an inner-shell electron, whose 
ground-state energy lies typically some hundreds or thousands of electron volts 
below the Fermi level of the solid. Unoccupied electron states exist only above 
the Fermi level, so the inner-shell electron can make an upward transition only 
if it absorbs an amount of energy similar to or greater than its original binding 
energy. Because the total energy is conserved at each collision, the fast electron 
loses an equal amount of energy and is scattered through an angle typically of 
the order of 10 mrad for 100-keV incident energy. As a result of this inner-shell 
scattering, the target atom is left in a highly excited (or ionized) state and will 
quickly lose its excess energy. In the de-excitation process, an outer-shell elec- 
tron (or an inner-shell electron of lower binding energy) undergoes a downward 
transition to the vacant “core hole” and the excess energy is liberated as electro- 
magnetic radiation (x-rays) or as kinetic energy of another atomic electron (Auger 
emission). 
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Fig. 1.2 Energy-level diagram of a solid, including K- and L-shell core levels and a valence band 
of delocalized states (shaded); Ep is the Fermi level and Ey,. the vacuum level. The primary pro- 
cesses of inner- and outer-shell excitation are shown on the /eft, secondary processes of photon and 
electron emission on the right 


Outer-shell electrons can also undergo single-electron excitation. In an insula- 
tor or semiconductor, a valence electron makes an interband transition across the 
energy gap; in the case of a metal, a conduction electron makes a transition to a 
higher state, possibly within the same energy band. If the final state of these transi- 
tions lies above the vacuum level of the solid and if the excited atomic electron has 
enough energy to reach the surface, it may be emitted as a secondary electron. As 
before, the fast electron supplies the necessary energy (generally a few electron volts 
or tens of electron volts) and is scattered through an angle of typically 1 or 2 mrad 
(for Eg © 100 keV). In the de-excitation process, electromagnetic radiation may be 
emitted in the visible region (cathodoluminescence), although in many materials the 
reverse transitions are radiationless and the energy originally deposited by the fast 
electron appears as heat. Particularly in the case of organic compounds, not all of 
the valence electrons return to their original configuration; the permanent disruption 
of chemical bonds is described as ionization damage or radiolysis. 

As an alternative to the single-electron mode of excitation, outer-shell inelastic 
scattering may involve many atoms of the solid. This collective effect is known as 
a plasma resonance (an oscillation of the valence electron density) and takes the 
form of a longitudinal traveling wave, similar in some respects to a sound wave. 
According to quantum theory, the excitation can also be described in terms of the 
creation of a pseudoparticle, the plasmon, whose energy is Ey = h wp, where h is 
Planck’s constant and wp is the plasmon frequency in radians per second, which is 
proportional to the square root of the valence electron density. For the majority of 
solids, Ep lies in the range 5—30 eV. 

Plasmon excitation being a collective phenomenon, the excess energy is shared 
among many atoms when viewed over an extended time period. At a given instant, 
however, the energy is likely to be carried by only one electron (Ferrell, 1957), 


1.2 The Electron Energy-Loss Spectrum 5 


which makes plausible the fact that plasmons can be excited in insulators, Ep being 
generally higher than the excitation energy of the valence electrons (i.e., the band 
gap). The essential requirement for plasmon excitation is that the participating elec- 
trons can communicate with each other and share their energy, a condition that is 
fulfilled for a band of delocalized states but not for the atomic-like core levels. The 
lifetime of a plasmon is very short; it decays by depositing its energy (via interband 
transitions) in the form of heat or by creating secondary electrons. 

In addition to exciting volume or “bulk” plasmons within the specimen, a fast 
electron can create surface plasmons at each exterior surface. However, these surface 
excitations dominate only in very thin (<20 nm) samples or small particles. 

Plasmon excitation and single-electron excitation represent alternative modes of 
inelastic scattering. In materials in which the valence electrons behave somewhat 
like free particles (e.g., the alkali metals), the collective form of response is predom- 
inant. In other cases (e.g., rare gas solids), plasmon effects are weak or nonexistent. 
Most materials fall between these two extremes. 


1.2 The Electron Energy-Loss Spectrum 


The secondary processes of electron and photon emission from a specimen can be 
studied in detail by appropriate spectroscopies, as discussed in Section 1.4. In elec- 
tron energy-loss spectroscopy, we deal directly with the primary process of electron 
excitation, which results in the fast electron losing a characteristic amount of energy. 
The transmitted electron beam is directed into a high-resolution electron spectrom- 
eter that separates the electrons according to their kinetic energy and produces an 
electron energy-loss spectrum showing the number of electrons (scattered intensity) 
as a function of their decrease in kinetic energy. 

A typical loss spectrum, recorded from a thin specimen over a range of about 
1000 eV, is shown in Fig. 1.3. The first zero-loss or “elastic” peak represents 
electrons that are transmitted without suffering measurable energy loss, including 
electrons scattered elastically and those that excite phonon modes, for which the 
energy loss is less than the experimental energy resolution. In addition, the zero- 
loss peak includes electrons that can be regarded as unscattered, since they lose no 
energy and remain undeflected after passing through the specimen. The correspond- 
ing electron waves undergo a phase change but this is detectable only by holography 
or high-resolution imaging. 

Inelastic scattering from outer-shell electrons is visible as a peak (or a series of 
peaks, in thicker specimens) in the 440 eV region of the spectrum. At higher energy 
loss, the electron intensity decreases rapidly, making it convenient to use a logarith- 
mic scale for the recorded intensity, as in Fig. 1.3. Superimposed on this smoothly 
decreasing intensity are features that represent inner-shell excitation; they take the 
form of edges rather than peaks, the inner-shell intensity rising rapidly and then 
falling more slowly with increasing energy loss. The sharp rise occurs at the ioniza- 
tion threshold, whose energy-loss coordinate is approximately the binding energy 
of the corresponding atomic shell. Since inner-shell binding energies depend on the 
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Fig. 1.3. Electron energy-loss spectrum of a high-temperature superconductor (YBazCu307) with 
the electron intensity on a logarithmic scale, showing zero-loss and plasmon peaks and ionization 
edges arising from each element. Courtesy of D.H. Shin, Cornell University 


atomic number of the scattering atom, the ionization edges present in an energy-loss 
spectrum reveal which elements are present within the specimen. Quantitative ele- 
mental analysis is possible by measuring an area under the appropriate ionization 
edge, making allowance for the underlying background. 

When viewed in greater detail, both the valence electron (low-loss) peaks and the 
ionization edges possess a fine structure that reflects the crystallographic or energy 
band structure of the specimen. Even with an energy resolution of 2 eV, it is possible 
to distinguish between different forms of an element such as carbon, as illustrated 
in Fig. 1.4. 

If the energy-loss spectrum is recorded from a sufficiently thin region of the spec- 
imen, each spectral feature corresponds to a different excitation process. In thicker 
samples, there is a substantial probability that a transmitted electron will be inelas- 
tically scattered more than once, giving a total energy loss equal to the sum of the 
individual losses. In the case of plasmon scattering, the result is a series of peaks at 
multiples of the plasmon energy (Fig. 1.5). The plural (or multiple) scattering peaks 
have appreciable intensity if the specimen thickness approaches or exceeds the mean 
free path of the inelastic scattering process, which is typically 50-150 nm for outer- 
shell scattering at 100-keV incident energy. Electron microscope specimens are 
typically of this thickness, so plural scattering is usually significant and generally 
unwanted, since it distorts the shape of the energy-loss spectrum. Fortunately it be 
removed by various deconvolution procedures. 

On a classical (particle) model of scattering, the mean free path (MFP) is an 
average distance between scattering events. More generally, the MFP is inversely 
proportional to a scattering cross section, which is a direct (rather than inverse) 
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Fig. 1.4 Low-loss and K-ionization regions of the energy-loss spectra of three allotropes of car- 
bon, recorded on photographic plates and then digitized (Egerton and Whelan, 1974a, b). The 
plasmon peaks occur at different energies (33 eV in diamond, 27 eV in graphite, and 25 eV in 
amorphous carbon) because of the different valence electron densities. The K-edge threshold is 
shifted upward by about 5 eV in diamond due to the formation of an energy gap. The broad peaks 
indicated by dashed lines are caused by electrons that undergo both K-shell and plasmon scattering 
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Fig. 1.5 Energy-loss spectra recorded from silicon specimens of two different thicknesses. The 
thin sample gives a strong zero-loss peak and a weak first-plasmon peak; the thicker sample 
provides plural scattering peaks at multiples of the plasmon energy 
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measure of the intensity of scattering from each atom (or molecule) and which can 
be calculated by the use of either classical physics or quantum mechanics. 

Inner-shell excitation gives rise to relatively low scattered intensity (due to the 
low cross section) and therefore has a mean free path that is long compared to 
the specimen thickness. The probability that a fast electron produces more than 
one inner-shell excitation is therefore negligible. However, an electron that has 
undergone inner-shell scattering may (with fair probability) also cause outer-shell 
excitation. This “mixed” inelastic scattering again involves an energy loss that is 
the sum of the two separate losses, and results in a broad peak above the ioniza- 
tion threshold, displaced from the threshold by approximately the plasmon energy; 
see Fig. 1.4. If necessary, this mixed-scattering intensity can be removed from the 
spectrum by deconvolution. 


1.3 The Development of Experimental Techniques 


We will consider now how techniques evolved for recording and analyzing the 
energy-loss spectrum of fast electrons, particularly in combination with electron 
microscopy. More recent instrumental developments are dealt with in greater detail 
in later chapters. 

In his doctoral thesis, published in 1929, Rudberg reported measurements of the 
kinetic energy of electrons after reflection from the surface of a metal such as copper 
or silver. The kinetic energy was determined using a magnetic field spectrometer that 
bent the electron trajectories through 180° (in a 25-mm radius) and gave a resolu- 
tion of about | part in 200, adequate for the low primary energies that were used 
(40-900 eV). By measuring currents with a quadrant electrometer, the electron 
intensity could be plotted as a function of energy loss, as in Fig. 1.6. Rudberg 
showed that the loss spectrum was characteristic of the chemical composition of 
the sample and independent of the primary energy and the angle of incidence. In 
these experiments, oxidation of the surface was minimized by preparing the sample 
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Fig. 1.6 Energy-loss spectrum of 204-eV electrons reflected from the surface of an evaporated 
film of copper (Rudberg, 1930). The zero-loss peak is shown on a reduced intensity scale. From 
Rudberg (1930), with permission of The Royal Society 
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in situ by evaporation onto a Mo or Ag substrate that was kept electrically heated. 
Similar measurements were later carried out on a large number of elements by 
Powell, Robins, and Best at the University of Western Australia. The reflection tech- 
nique has since been refined to give energy resolution of a few milli electron volts 
at incident energies of a few hundred electron volts (Ibach and Mills, 1982) and has 
also been implemented in a field-emission scanning electron microscope (Cowley, 
1982). 

The first measurement of the energy spectrum of transmitted electrons was 
reported by Ruthemann (1941), using higher incident energy (2-10 keV), an 
improved magnetic spectrometer (bend radius = 175 mm, resolving power | in 
2000), and photographic recording. Figure 1.7a shows Ruthemann’s energy-loss 
spectrum of a thin self-supporting film of Al; it displays a series of peaks at mul- 
tiples of 16 eV, which were later interpreted in terms of plasma oscillation (Bohm 
and Pines, 1951). In 1942, Ruthemann reported the observation of inner-shell losses 
in a thin film of collodion, a form of nitrocellulose (Fig. 1.7b). 

A first attempt to use inner-shell losses for elemental microanalysis was made by 
Hillier and Baker (1944), who constructed an instrument that could focus 25-75 keV 
electrons into a 20-nm probe and operate as either a microprobe or a shadow micro- 
scope. Two condenser lenses were used to focus the electrons onto the specimen 
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Fig. 1.7 (a) Energy-loss spectrum of 5.3-keV electrons transmitted through a thin foil of alu- 
minum (Ruthemann, 1941), exhibiting plasmon peaks at multiples of 16 eV loss. (b) Energy-loss 
spectrum of 7.5-keV electrons transmitted through a thin film of collodion, showing K-ionization 
edges arising from carbon, nitrogen, and oxygen. Reprinted from Ruthemann (1941), copyright 
SpringerLink 
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Fig. 1.8 Photograph of the first electron microanalyzer (Hillier and Baker, 1944). Two magnetic 
lenses focused electrons onto the specimen, which was located within the bore of the second lens. 
A third lens focused the transmitted beam into a 180° spectrometer or produced a shadow image 
of the specimen when the spectrometer field was turned off 


and a third lens served to couple the transmitted electrons into a 180° magnetic 
spectrometer (Fig. 1.8). Because of the poor vacuum and resulting hydrocarbon con- 
tamination, the 20-nm probe caused specimens to become opaque in a few seconds. 
Therefore an incident beam diameter of 200 nm was used, corresponding to the 
analysis of 10~!°-10~'4 g of material. K-ionization edges were recorded from sev- 
eral elements (including Si), as well as L- and M-edges of iron. Spectra of collodion 
showed K-edges of carbon and oxygen; the nitrogen edge was usually absent, most 
likely due to the preferential removal of nitrogen by the electron beam (Egerton, 
1980f). 

In 1949, Mollenstedt published the design of an electrostatic energy analyzer 
in which electrons were slowed down by passing them between two cylindrical 
electrodes connected to the electron-source voltage. This deceleration results in 
high off-axis chromatic aberration (large dispersion) and an energy resolution of 
about | part in 50,000, allowing high-resolution spectra to be recorded on photo- 
graphic plates. The MGllenstedt analyzer was subsequently added to conventional 
electron microscopes (CTEMs) in several laboratories, for example, by Marton (at 
NBS, Washington), Boersch (Berlin), and Watanabe (Tokyo), and at the Cavendish 
Laboratory (Cambridge). It was usually attached to the bottom of the TEM column, 
allowing the microscope to retain its full range of magnification and diffraction 
facilities. Because this analyzer is nonfocusing in a direction parallel to the elec- 
trodes, a long (but narrow) entrance slit was used, enabling the spectrum to be 
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recorded as a function of position in the specimen or, with a diffraction pattern 
present on the TEM screen, as a function of electron scattering angle. 

An alternative use of the deceleration principle was employed by Blackstock 
et al. (1955), Klemperer and Shepherd (1963), Kincaid et al. (1978), and Ritsko 
(1981). The electron source and analyzer were at ground potential, the electrons 
being accelerated toward the specimen and decelerated afterward. This design pro- 
vides good energy resolution but is difficult to apply to an electron microscope, 
where the specimen cannot easily be raised to a high potential. Fink (1989) used 
a spectrometer system with the sample at ground potential, the electron source, 
monochromator, analyzer, and detector being at —170 keV. A retarding-field spec- 
trometer was also used by Raether and colleagues in Hamburg, who conducted 
in-depth studies of the angular distribution and dispersion of bulk and surface 
plasmons in a wide variety of materials (Raether, 1980). 

A combination of electric and magnetic fields (Wien filter) was first used 
for transmission energy-loss measurements by Boersch et al. (1962) in Berlin. 
For a given energy resolution, the entrance slit can be much wider than for the 
MGllenstedt analyzer (Curtis and Silcox, 1971), allowing the angular distribution 
of both strong and weak energy-loss peaks to be studied in detail (Silcox, 1977, 
1979). By using a second Wien filter as a pre-specimen monochromator, Boersch 
and colleagues eventually achieved an energy resolution of 4-6 meV at 25 keV 
incident energy (Geiger et al., 1970). This degree of resolution is sufficient to reveal 
phonon-loss peaks as well as vibrational modes and intraband electronic excita- 
tions; see Fig. 1.9. The Berlin apparatus was also used to study the composition 
of beam-induced contamination layers and weak energy gains (vibrational peaks at 
23, 43, and 52 meV) could sometimes be detected (Katterwe, 1972). Since there 
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Fig. 1.9 Energy-loss spectrum of a 25-nm germanium film showing phonon and vibrational 
modes, as well as intraband electronic transitions (Schréder, 1972) 
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was no strong focusing lens, the instrument lacked spatial resolution (the beam 
diameter at the specimen was about 10 zm) and was not used extensively because 
similar spatial resolution and better energy resolution could be obtained by infrared 
spectroscopy. 


1.3.1 Energy-Selecting (Energy-Filtering) Electron Microscopes 


Instead of recording the energy-loss spectrum from a particular region of spec- 
imen, it is sometimes preferable to display a magnified image of the specimen 
(or its diffraction pattern) at a selected energy loss. This can be done by utilizing 
the imaging properties of a magnetic field produced between prism-shaped pole- 
pieces, as first demonstrated by Castaing and Henry (1962) at the University of 
Paris. Like the normal unfiltered TEM image, a plasmon-loss image was found 
to contain diffraction contrast due to differences in elastic scattering, but in suit- 
able specimens it also conveyed “chemical contrast” that was useful for identifying 
different crystallographic phases (Castaing, 1975). Installed in various laborato- 
ries, the Castaing—Henry filter was also used to record spectra and images from 
inner-shell energy losses (Colliex and Jouffrey, 1972; Henkelman and Ottensmeyer, 
1974a; Egerton et al., 1974). At the University of Toronto, Ottensmeyer reduced the 
aberrations of his filter by curving the prism edges, a design that was eventually 
incorporated into a TEM by the Zeiss company. 

In order to maintain a straight electron-optical column, the Castaing—Henry filter 
uses an electrostatic mirror electrode at the electron gun potential, an arrange- 
ment not well suited to high-voltage microscopes. For their 1-MeV microscope at 
Toulouse, Jouffrey and colleagues adopted the purely magnetic “omega filter,” based 
on a design by Rose and Plies (1974). In Berlin, Zeitler’s group improved this sys- 
tem by correcting various aberrations, resulting in a commercial product: the Zeiss 
EM-912 energy-filtering microscope (Bihr et al., 1991). The omega filter has since 
been incorporated into TEM designs by other manufacturers. 

An alternative method of energy filtering is based on the scanning transmis- 
sion electron microscope (STEM). In 1968, Crewe and co-workers in Chicago built 
the first high-resolution STEM and later used an energy analyzer to improve their 
images of single heavy atoms on a thin substrate. At the same laboratory, Isaacson, 
Johnson, and Lin recorded fine structure present in the energy-loss spectra of amino 
acids and nucleic acid bases and used fading of this structure as a means of assessing 
electron irradiation damage to these biologically important compounds. 

In 1974, the Vacuum Generators Company marketed a field-emission STEM 
(the HBS), on the prototype of which Burge and colleagues at London University 
installed an electrostatic energy analyzer (Browne, 1979). Ferrier’s group at 
Glasgow University investigated the practical advantages and disadvantages of 
adding post-specimen lenses to the STEM, including their effect on spectrometer 
performance. Isaacson designed an improved magnetic spectrometer for the HBS, 
while Batson showed that superior energy resolution and stability could be obtained 
by using a retarding-field Wien filter. Various groups (for example, Colliex and 


1.3. The Development of Experimental Techniques 13 


co-workers in Paris, Brown and Howie in Cambridge, Spence and colleagues in 
Arizona) used the HB5 and its descendants to explore the high-resolution possi- 
bilities of EELS. Leapman and co-workers at NIH (Washington) employed STEM 
energy-selected imaging to record elemental maps of biological specimens at 10-nm 
resolution and showed that energy-loss spectra can reveal concentrations (of lan- 
thanides and transition metals) down to 10 ppm within 50-nm diameter areas of 
an inorganic test specimen. Using a STEM with a high-excitation objective lens, 
Pennycook and colleagues demonstrated that atomic columns in a suitably ori- 
ented crystalline specimen can be imaged using high-angle elastic scattering, the 
low-angle energy-loss signal being simultaneously available to extract chemical 
information at atomic resolution. Correction of the objective lens spherical aber- 
ration, as achieved in the NiOn STEM instrument, makes atom column imaging and 
spectroscopy easier and has led to the identification and imaging of single atoms 
(Krivanek et al., 2010). 


1.3.2 Spectrometers as Attachments toa TEM 


Starting in the mid-1970s, EELS attracted the attention of electron microscopists as 
a method of light element microanalysis. For this purpose a single-prism magnetic 
spectrometer mounted beneath a conventional TEM is sufficient; Marton (1946) 
appears to have been the first to assemble such a system. At the Cavendish labora- 
tory, Wittry (1969) employed the electron optical arrangement that is now widely 
used: the prism operates with the projector lens crossover as its object point and a 
spectrometer-entrance aperture (just below the TEM screen) selects the region of 
specimen (or its diffraction pattern) being analyzed (Fig. 1.10). Serial recording of 
the spectrum was achieved by scanning it across an energy-selecting slit preced- 
ing a single-channel electron detector (scintillator and photomultiplier tube). Based 
on these principles, Krivanek constructed a magnetic spectrometer that was small 
enough to fit below any conventional TEM, this design being marketed by the Gatan 
company as their Model 607 serial recording EELS system. Joy and Maher (at Bell 
Laboratories) and Egerton (in Alberta) developed data analysis systems and soft- 
ware for electronic storage of energy-loss spectra and background subtraction at 
core edges, as required for quantitative elemental analysis. 

The energy-loss spectrum can be recorded simultaneously (rather than sequen- 
tially) by means of a position-sensitive detector, such as a photodiode or charge- 
coupled diode (CCD) array. Following development work in several university lab- 
oratories, Gatan introduced in 1986 their Model 666 spectrometer, using quadrupole 
lenses to project the spectrum onto a YAG transmission screen and photodiode 
array. Parallel-recording spectrometers greatly reduce the time needed to record 
inner-shell losses, resulting in less drift and electron irradiation of the specimen. 
The Gatan Enfina spectrometer was a refinement of this design, with less light 
spreading in the scintillator and the photodiode array replaced by a rectangular 
(100 x 1340-element) CCD detector. 
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Fig. 1.10 TEM energy 
analysis system with a 
magnetic prism spectrometer 
(45° deflection angle) using 
the projector lens crossover as 
its object point (Wittry, 1969, 
http://iopscience.iop.org/ 
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The Gatan imaging filter (GIF) used both quadrupole and sextupole lenses to 
correct spectrometer aberrations, with a two-dimensional CCD array as the detec- 
tor. Installed beneath a conventional TEM, it performed similar functions to an 
in-column (e.g., omega) filter. The ability to produce an elemental map (by select- 
ing an appropriate ionization edge and correcting for the pre-edge background) 
increases the analytical power of EELS, allowing elemental segregation to be 
imaged in a semiquantitative manner. The original GIF used a 1k x 1k detector, 
increased to 2k x 2k in its successor, the GIF Tridiem. In 2009, Gatan introduced 
the GIF Quantum incorporating fifth-order aberration correction (allowing a 9-mm 
entrance aperture), faster CCD readout, and a 1-s electrostatic shutter (developed 
at Glasgow University) to allow near-simultaneous recording of the low-loss and 
core-loss regions of a spectrum. 

More complete information about a TEM specimen is contained in its spectrum 
image, obtainable using a STEM with a parallel-recording detector to record an 
entire energy-loss spectrum at each pixel or as a sequence of energy-selected images 
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recorded using a two-dimensional detector in an energy-filtering TEM. The Gatan 
company developed software for spectrum image processing, making this technique 
more powerful and easy to use. 

For many years, the energy resolution of most TEM-EELS systems remained 
around 1-2 eV, being limited by the energy width of thermionic (tungsten fila- 
ment or LaBg) electron source. In the late 1990s, Schottky emission sources became 
widely available, offering resolution down to 0.5 eV, and were followed by the com- 
mercial development of gun monochromators, starting with a Wien filter design 
from the FEI company (Tiemeijer et al., 2001). Following improvements to the sta- 
bility of TEM high-voltage supplies and spectrometer power supplies, an energy 
resolution down to 0.1 eV is now possible for a TEM located in a low-noise 
environment. 


1.4 Alternative Analytical Methods 


Electron energy-loss spectroscopy is only one of the many techniques available 
for determining the structure and/or chemical composition of a solid. Some of the 
techniques that are capable of high spatial resolution are listed in Table 1.1. As 
pointed out by Wittry (1980), each method employs a well-known physical principle 
but attains usefulness as a microanalytical tool only when suitable instrumentation 
becomes available. 


Table 1.1 Imaging and analysis techniques employing electron, ion, and photon beams, with 
estimates of the achievable spatial resolution 


Incident Detected 
beam signal Examples Resolution (nm) 
Electron Electron Electron microscopy (TEM, STEM) 0.1 
Electron diffraction (SAED, CBED) 10-1000 
Electron energy-loss spectroscopy (EELS) <l 
Auger electron spectroscopy (AES) ~2 
Photon X-ray emission spectroscopy (XES) 2-10 
Cathodoluminescence (CL) 
Ion Ion Rutherford backscattering spectroscopy (RBS) 1000 
Secondary ion mass spectrometry (SIMS) 50 
Local electrode atom probe (LEAP) 0.1 
Photon Proton-induced x-ray emission (PIXE) 500 
Photon Photon X-ray diffraction (XRD) 30 
X-ray absorption spectroscopy (XAS) 20 
X-ray fluorescence spectroscopy (XRF) 
Electron X-ray photoelectron spectroscopy (XPS) 5-10 
Ultraviolet photoelectron spectroscopy (UPS) 1000 
Photoelectron microscopy (PEM or PEEM) 0.5 


Ion Laser microprobe mass analysis (LAMMA) 1000 
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Some of these analytical techniques, such as Auger spectroscopy, are surface- 
sensitive: they characterize the first monolayer (or few monolayers) of atoms. 
Others, such as EELS or electron diffraction using many-keV electrons, probe 
deeper into the bulk or (in the case of a thin specimen) provide information inte- 
grated over specimen thickness. Which category of technique is preferable depends 
on the kind of information required. 

Analysis techniques might also be classified as destructive or nondestructive. 
Secondary ion mass spectrometry (SIMS) and atom probe tomography are exam- 
ples of techniques that are necessarily destructive (non-repeatable), a property that 
may be a disadvantage but which can be utilized to give three-dimensional infor- 
mation by “depth profiling.” Electron beam methods can also be destructive, since 
inelastic scattering of the incident electrons can result in radiation damage. The 
extent of this damage depends on the electron dose needed to give a useful sig- 
nal. Elemental analysis by EELS, Auger, or x-ray emission spectroscopy relies on 
inner-shell scattering, which is comparatively weak, so radiation damage can be a 
serious problem. Transmission electron microscopy and electron diffraction utilize 
elastic scattering, which is relatively strong. However, these latter techniques are 
often used to determine structure down to the atomic level, so damage is sometimes 
still a problem. 

The different techniques can also be grouped according to the type of incident 
and detected particle, as in Table 1.1. We now outline some of the important char- 
acteristics of each technique, leaving electron energy-loss spectroscopy until the 
end of the discussion. Not included is scanning tunneling spectroscopy, which can 
attain atomic resolution when performed with a very sharp tip (in combination 
with scanning tunneling microscopy). STS gives information about the density of 
valence states and can achieve a degree of chemical specificity when combined with 
vibrational-mode spectroscopy (Zandvliet and Van Houselt, 2009). 


1.4.1 Ion Beam Methods 


In secondary ion mass spectrometry (SIMS), a specimen is bombarded with 
1-20 keV ions, causing surface atoms to be sputtered away, some as secondary 
ions whose atomic number is determined by passing them through a mass 
spectrometer. Since the surface is steadily eroded, elemental concentrations are 
obtained as a function of depth. A spatial resolution of | jum is routine and 
50-100 nm is possible with a liquid—metal source (Levi-Setti, 1983). All elements 
are detectable, including hydrogen, which has been measured in silicon at con- 
centrations below 1% (Magee, 1984). Imaging of the secondary ions is possible, 
at sub-micrometer spatial resolution (Grivet and Septier, 1978). Quantification is 
complicated by the fact that sputtering gives rise mainly to neutral atoms: the 
yield of ions is low and highly dependent on the chemical composition of the 
matrix. To avoid these difficulties, the sputtered atoms can be ionized (by an elec- 
tron gun, laser, or radio-frequency cavity), the technique then being known as 
SNMS. 
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If the incident beam consists of high-energy light ions (e.g., 1-MeV protons or He 
ions), the sputtering rate is low but some of the ions are elastically scattered in the 
backward direction. The energy-loss spectrum of these reflected primary ions con- 
tains peaks or edges whose energy is characteristic of each backscattering element 
and whose width gives an indication of the depth distribution, since ions that are 
backscattered deep in the sample lose energy on their way out. Rutherford backscat- 
tering spectroscopy (RBS) therefore offers a nondestructive method of performing 
three-dimensional elemental analysis. However, the lateral spatial resolution is 
limited by the current density available in the incident beam. 

Very low elemental concentrations (0.1—10 ppm) can be determined from proton- 
induced x-ray emission (PIXE). Most of the incident protons (typically of energy 
1-5 MeV) are deflected only slightly by the nuclear field, so the bremsstrahlung 
background to the characteristic x-ray peaks is lower than when using incident 
electrons. Spatial resolution of 1 zm has been demonstrated (Johansson, 1984). 

Field-ion microscopy (FIM) relies on the field ionization of an image gas, typ- 
ically neon or helium, above the apex of a sharp needle-shaped specimen. These 
specimens are often electropolished from small bars cut from the bulk material. 
Raising the field allows atoms to be field evaporated and elementally identified in 
a time-of-flight mass spectrometer, the instrument then being known as an atom 
probe (Miller, 2000). By accumulating data from many millions of field-evaporated 
ions, one can obtain the three-dimensional distribution of each element at the atomic 
scale. Although unsurpassed in terms of three-dimensional spatial resolution, atom 
probe tomography (APT) has the disadvantage of examining only a small volume 
(a truncated cone typically 100-1000 nm long and up to 200 nm in diameter) of the 
specimen. However, focused ion beam (FIB) machines now make it possible to pre- 
pare needle-shaped specimens from chosen regions of a bulk sample (Miller et al., 
2007). 

Stender et al. (2009) have compared atom probe tomography with energy- 
filtering TEM for analysis of Fe/Cr multilayers. For a compendium of detailed 
information about ion beam techniques, see Wang and Nastasi (2010). 


1.4.2 Incident Photons 


X-ray diffraction is a convenient technique for determining the symmetry of crys- 
tals and measuring lattice parameters to high accuracy. With laboratory x-ray 
sources, a relatively large volume of specimen (many cubic micrometers) is nec- 
essary to record diffraction spots that stand out above the instrumental background. 
Synchrotron sources offer much higher intensity and improved reciprocal space res- 
olution, since beam divergence can be made much smaller than for a laboratory 
source. With a coherent beam, spatial resolution can also be obtained through lens- 
less diffractive imaging, employing iterative Fourier techniques (Spence, 2005), and 
imaging of viruses and proteins (at 0.8-nm resolution) has been achieved prior to 
radiation damage by means of “diffract and destroy” techniques (Chapman, 2009). 
Without such fast-pulse techniques, radiation damage to organic specimens by 
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x-rays has been estimated to be a factor of 10° to 10+ larger than by electrons, 
for the same diffraction information (Henderson, 1995). This factor arises from the 
relatively weak elastic scattering of x-rays and the much greater energy (almost the 
full photon energy) deposited by a photon during photoelectron excitation, com- 
pared to about 40 eV per inelastic excitation (+120 eV per elastic diffraction event) 
for electrons. 

In x-ray absorption spectroscopy (XAS), the intensity of a transmitted beam of 
X-rays is measured as a function of incident wavelength (i.e., photon energy). To 
obtain sufficient intensity and wavelength tunability, a high-brightness synchrotron 
source is preferred. X-ray absorption edges occur at incident energies close to the 
binding energy of each atomic shell and are a close analog to the ionization edges 
seen in electron energy-loss spectra. Extended absorption fine structure (EXAFS) 
occurs up to several hundred electron volts beyond the absorption edge, and can 
be used to determine interatomic distances and other properties around a specific 
atomic site, for both crystalline and amorphous specimens. X-ray absorption near- 
edge structure (XANES or NEXAFS), covering a range —15 to +50 eV relative to 
the binding energy, provides details of the electronic structure at a given atomic site, 
and can be used for many different analytical purposes. 

Soft x-rays from a synchrotron can be focused by means of zone plates to yield 
an x-ray microscope with a resolution down to about 20 nm. Contrast due to differ- 
ences in absorption coefficient can be used to map individual elements or even (via 
XANES fine structure) different chemical environments of the same element (Ade 
et al., 1992). Although spatial resolution is inferior to that of the electron micro- 
scope, radiation damage may be less and the specimen need not be in vacuum. In 
the case of thicker specimens, three-dimensional information is obtainable by the 
use of tomographic techniques. 

Photoelectron spectroscopy is carried out with incident x-rays (XPS) or ultravi- 
olet radiation (UPS). In the former case, electrons are released from inner atomic 
shells and enter an electron spectrometer that produces a spectrum containing peaks 
at an energy directly related to the inner-shell binding energy of each element 
present in the sample (Watts and Wolstenholme, 2005). Besides providing elemen- 
tal analysis, the XPS peaks have chemical shifts that can reveal the oxidation state 
of each element. In UPS, valence electrons are excited by the incident radiation, 
and the electron spectrum is characteristic of the valence band states. Photoelectron 
microscopy (PEEM) is possible by immersing the sample in a strong magnetic or 
electrostatic field and imaging the photoelectrons, with the possibility of energy dis- 
crimination (Beamson et al., 1981); a resolution of 5.4 nm has been demonstrated 
(K6nenkamp et al., 2010). 

In laser microprobe mass analysis (LAMMA), light is focused into a small- 
diameter probe (>500 nm diameter) to volatilize a small region of a sample, 
releasing ions that are analyzed in a mass spectrometer. All elements can be detected 
and measured with a sensitivity of the order of 1 ppm. The analyzed volume is typ- 
ically 0.1 xm, so a sensitivity of 10~!° g is feasible (Schmidt et al., 1980). As 
in the case of SIMS, quantification is complicated by the fact that the ionization 
probability is matrix dependent. If necessary, different isotopes can be distinguished. 
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1.4.3 Electron Beam Techniques 


Transmission electron microscopy (TEM) is capable of atomic resolution, using 
either a conventional (CTEM) or a scanning transmission (STEM) instrument. 
In the case of crystalline specimens, CTEM “chemical lattice images” (phase- 
contrast images obtained under particular conditions of specimen thickness and 
defocus) allow columns of different elements to be distinguished (Ourmadz et al., 
1990). However, a high-resolution STEM fitted with a high-angle annular dark-field 
(HAADEF) detector now enables atomic columns to be directly imaged, atomic num- 
ber contrast arising from the fact that the high-angle scattering is almost proportional 
to Z. 

Transmission electron diffraction also offers good spatial resolution. Selected 
area electron diffraction (SAED) is limited by spherical aberration of the objective 
lens (in an uncorrected instrument) but for convergent beam diffraction (CBED) the 
analyzed region is defined by the diameter of the incident beam: down to about | nm 
in the case of a field-emission gun, but broadened because of beam spreading in the 
specimen unless the latter is very thin. Besides giving information about crystal 
symmetry, CBED has been used to measure small (0.1%) changes in lattice param- 
eter arising from compositional gradients. Some metal alloys contain precipitates 
whose CBED pattern is sufficiently characteristic to enable their chemical composi- 
tion to be identified through a fingerprinting procedure (Steeds, 1984) and the point 
or space group symmetry in many cases, nicely complementing EELS technique 
(Williams and Carter, 2009). 

In general, TEM imaging and diffraction provide structural information that is 
complementary to the structural and chemical information provided by EELS. All 
three techniques can be combined in a single instrument without any sacrifice of 
performance. In addition, the use of an electron spectrometer to produce energy- 
filtered images and diffraction patterns can greatly increase the information available 
(Auchterlonie et al., 1989; Spence and Zuo, 1992; Midgley et al., 1995). 

Auger electron spectroscopy (AES) can be carried out with incident x-rays or 
charged particles; however, the highest spatial resolution is obtained by using an 
electron beam, which also permits scanning Auger microscopy (SAM) on suitable 
specimens. Auger peak energies are characteristic of each element present at the 
sample but AES is particularly sensitive to low-Z elements, which have high Auger 
yields. Quantification is more complicated than for EELS and may need to rely 
on the use of standards (Seah, 1983). The detected Auger electrons have energies 
in the range 20-500 eV, where the escape depth is of the order 1 nm. The tech- 
nique is therefore highly surface sensitive and requires ultrahigh vacuum. Bulk 
specimens can be used, in which case backscattering limits the spatial resolution 
to about 100 nm. Use of a thin specimen and improved electron optics allows a 
spatial resolution of around 2 nm (Hembree and Venables, 1992) and the identifica- 
tion of single atoms in a sufficiently radiation-resistant specimen might be possible 
(Cazaux, 1983). In practice, a resolution of 50 nm is typical for a 35-keV, 1-nA inci- 
dent beam (Riviére, 1982); the usable resolution is generally limited by statistical 
noise present in the signal, a situation that is improved by use of a parallel-recording 
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electron detector. Even then, the measured signal will be less than for EELS because 
only a fraction of the excited Auger electrons (those generated within an escape 
depth of the surface) are detected. 

X-ray emission spectroscopy (XES) can be performed on bulk specimens, for 
example, using an electron-probe microanalyzer (EPMA) fitted with a wavelength- 
dispersive (WDX) spectrometer. As a method of elemental analysis, the EPMA 
technique has been refined to give good accuracy, about 1—2% of the amount present 
with appropriate standards and corrections for atomic number, absorption, and fluo- 
rescence effects. The accuracy becomes 5—10% for biological specimens (Goldstein 
et al., 2003) A mass fraction detection limit of 1000 ppm (0.1 wt%) is routine for 
most elements, and | ppm is possible for certain materials and operating conditions 
(Robinson and Graham, 1992), which corresponds to a detection limit of 107-17 gin 
an analyzed volume of a few cubic micrometers. The WDX spectrometer detects all 
elements except H and He and has an energy resolution ~10 eV. Alternatively, bulk 
specimens can be analyzed in a scanning electron microscope fitted with an energy- 
dispersive x-ray (EDX) spectrometer, offering shorter recording times but poorer 
energy resolution (~130 eV), which sometimes causes problems when analyzing 
overlapping peaks. 

Higher spatial resolution is available by using a thin specimen and a TEM fit- 
ted with an EDX detector, particularly in STEM mode with an electron probe of 
diameter below 10 nm. Characteristic x-rays are emitted isotropically from the spec- 
imen, resulting in a geometrical collection efficiency of typically 1% (for 0.13-sr 
collection solid angle). For a tungsten filament electron source, the detection limit 
for medium-Z element in a 100-nm specimen was estimated to be about 10~!? g 
(Shuman et al., 1976; Joy and Maher, 1977). Estimates of the minimum detectable 
concentration lie around 10 mmol/kg (0.04% by weight) for potassium in biological 
tissue (Shuman et al., 1976). For materials science specimens, the detection limits 
tend to be lower: metal catalyst particles of mass below 10~7° g have been analyzed 
using a field-emission source (Lyman et al., 1995), although radiation damage is a 
potential problem (Dexpert et al., 1982). For medium-Z elements in a 100-nm-thick 
Si matrix, mass fraction detection limits are in the range 0.05—3% (Joy and Maher, 
1977; Williams, 1987). With a windowless or ultrathin window (UTW) detector, 
elements down to boron can be detected, although the limited energy resolution of 
the EDX detector can lead to peak-overlap problems at low photon energies; see 
Fig. 1.1la. Quantitative analysis is usually carried out using the ratio methods of 
Cliff and Lorimer (1975) or of Watanabe and Williams (2006) for thin inorganic 
specimens or of Hall (1979) in the case of biological specimens. For the anal- 
ysis of light elements, extensive absorption corrections are necessary (Chan and 
Williams, 1985). 

The ultimate spatial resolution of thin-film x-ray analysis is limited by elastic 
scattering, which causes a broadening of the transmitted beam; see Fig. 1.12. For 
a 100-nm-thick specimen and 100-keV incident electrons, this broadening is about 
4 nm in carbon and increases with atomic number to 60 nm for a gold film of the 
same thickness (Goldstein et al., 1977). Inelastic scattering also degrades the spatial 
resolution, since it results in the production of fast secondary electrons that generate 
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Fig. 1.11 (a) X-ray emission 
spectrum recorded from an 
oxidized region of stainless 
steel, showing overlap of the 
oxygen K-peak with the 
L-peaks of chromium and 
iron. (b) Ionization edges are 
more clearly resolved in the 
energy-loss spectrum, as a 
result of the better energy 
resolution of the electron 
spectrometer (Zaluzec et al., 
1984). From Zaluzec et al. 
(1984), copyright San 
Francisco Press, Inc., with 
permission 


Fig. 1.12 Spreading of an 
electron beam within a thin 
specimen. X-rays are emitted 
from the dotted region, 
whereas the energy-loss 
spectrum is recorded from the 
hatched region, the 
spectrometer entrance 
aperture having a collimating 
effect. Auger electrons are 
emitted within a small depth 
adjacent to each surface 
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characteristic x-rays, particularly from light elements. For energy losses between 
1 and 10 keV, the fast secondaries are emitted almost perpendicular to the incident 
beam direction and have a range of the order of 10-100 nm (Joy et al., 1982). An 
x-ray signal is also generated at some distance from the incident beam by backscat- 
tered electrons, by secondary fluorescence (Bentley et al., 1984), and by any stray 
electrons or ions within the TEM column. 


1.5 Comparison of EELS and EDX Spectroscopy 


Originally, EDX detectors were protected from water vapor and hydrocarbons in the 
microscope vacuum by a 10-\m-thick beryllium window, which strongly absorbed 
photons of energy less than 1000 eV and precluded analysis of elements of atomic 
number less than 11. The subsequent deployment of ultrathin (UTW) and atmo- 
spheric pressure (ATW) windows allowed elements down to boron to be detected 
routinely. Recently, windowless in-column silicon drift detectors have been devel- 
oped (Schlossmacher et al., 2010) and offer a total solid angle as high as 0.9 sr, 
allowing 7% of the emitted x-rays to be analyzed. Wavelength-dispersive spectrom- 
eters with parallel-recording detectors are also available for the TEM and can detect 
elements down to Li (Terauchi et al., 2010a) with an energy resolution of typically 
1 eV but relatively low solid angle (Terauchi et al., 2010b). These developments 
make EDX spectroscopy competitive with EELS for the detection of light elements 
in a TEM specimen. Table 1.2 lists some of the factors relevant to a comparison of 
these two techniques. 


1.5.1 Detection Limits and Spatial Resolution 


For the same incident beam current, the count rate of characteristic x-rays is less 
than that of core-loss electrons (detectable by EELS) for two reasons. First, while 
the K-line fluorescence yield is above 50% for Z > 32, it falls below 2% for Z < 11 
(see Fig. 1.13), reducing the generation rate for light elements. Second, character- 
istic X-rays are emitted isotropically and the fraction recorded by an EDX detector 
is below 10% (only 1% for a 0.13-sr detector), whereas the energy-loss electrons 
are concentrated into a limited angular range, allowing a spectrometer collection 
efficiency of typically 20-50%. 


Table 1.2. Comparison of EELS with windowless EDX spectroscopy of a TEM specimen 


Advantages of EELS Disadvantages of EELS 
Higher core-loss signal Higher spectral background 
Higher ultimate spatial resolution Very thin specimen needed 
Absolute, standardless quantification Possible inaccuracy in crystals 


Structural information available More operator intensive 
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Fig. 1.13 X-ray fluorescence yield for K-, L-, and M-shells, as a function of atomic number, from 
Krause (1979) 


Unfortunately, the EELS background, arising from inelastic scattering by all 
atomic electrons whose binding energy is less than the edge energy, is generally 
higher than the EDX background, which arises from stray radiation in the TEM 
column and bremsstrahlung production. In addition, the characteristic features in 
an energy-loss spectrum are not peaks but edges; the core-loss intensity is spread 
over an extended energy range beyond the edge, making it less visible than the cor- 
responding peak in the x-ray spectrum. It is possible to define a signal/noise ratio 
(SNR) that takes account of the edge shape, and the minimum detectable concen- 
tration of an element can be shown to depend on SNR, not signal/background ratio 
(Section 5.6.3). On this basis, Leapman and Hunt (1991) compared the sensitivity 
of EELS and EDX spectroscopy and showed EELS capable of detecting smaller 
concentrations of elements of low atomic number; see Section 5.5.4. Using a field- 
emission STEM and parallel-recording EELS, Leapman and Newbury (1993) could 
detect concentrations down to 10 ppm for transition metals and lanthanides in pow- 
dered glass samples. Shuman et al. (1984) reported a sensitivity of 20 ppm for Ca 
in organic test specimens. In some specimens, the detection limit is determined by 
radiation damage and in this regard EELS is generally preferable to EDX spec- 
troscopy because a larger fraction of the inner-shell excitations can be recorded by 
the spectrometer. 

EELS also offers slightly better spatial resolution than x-ray emission spec- 
troscopy because the volume of specimen giving rise to the energy-loss signal can be 
limited by means of an angle-limiting aperture, as shown in Fig. 1.12. The effects of 
beam broadening (due to elastic scattering) and beam tails (if spherical aberration 
of the probe-forming lens is not corrected) should therefore be less, as confirmed 
experimentally (Collett et al., 1984; Titchmarsh, 1989; Geng et al., 2009). The 
energy-loss signal is also unaffected by absorption, secondary fluorescence, and 
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(b) (c) 


Fig. 1.14 (a) HAADF-STEM image of a [100]-projected GaAs specimen, showing bright Ga and 
As atomic columns. (b) EELS image showing the Ga-L»3 intensity in green and As-L3 intensity 
in red. (c) EDX spectroscopy image with Ga-Ka intensity in green and As-Ka intensity in red. 
Courtesy of M. Watanabe 


the generation of fast secondary electrons within the specimen, making quantifi- 
cation potentially more straightforward. With an aberration-corrected STEM and 
efficient detectors, elemental maps showing individual atomic columns in a crystal 
are now feasible, using either energy loss or EDX spectroscopy (Watanabe et al., 
2010a, b). As seen in Fig. 1.14, atomic columns are visible in the EDX image, but 
the EELS map shows superior signal/noise ratio because of the larger number of 
core excitations recorded. 

Spatial resolution is a major factor determining the minimum detectable mass of 
an element. Krivanek et al. (199 1a, b) used a 1-nm probe with about | nA current to 
identify clusters of one or two thorium atoms (on a thin carbon film) from the O45 
edge. More recently, Varela et al. (2004) reported the detection of single La atoms 
inside a thin specimen of CaTiO3, using EELS and an aberration-corrected STEM. 
The only alternative technique capable of single-atom identification is the field ion 
atom probe (Section 1.4.1), whose applications have been limited by problems of 
specimen preparation. However, focused ion beam (FIB) techniques ease the prepa- 
ration of sharp needle-shaped tips, and the development of the local electrode atom 
probe (LEAP) makes this technique a powerful competitor to EELS. 


1.5.2 Specimen Requirements 


If the specimen is too thick, plural scattering greatly increases the background to 
ionization edges below 1000 eV, making these edges invisible for specimens thicker 
than 100 or even 50 nm. This requirement places stringent demands on specimen 
preparation, which can sometimes be met by ion milling (of inorganic materials) or 
ultramicrotome preparation of ultrathin sections; see Section 1.6. The situation is 
eased somewhat by the use of higher accelerating voltage, although in many materi- 
als this introduces knock-on radiation damage by bulk- and surface-displacement 
processes. EDX spectroscopy can tolerate thicker specimens (up to a few hun- 
dred nanometers), although absorption corrections for light element quantification 
become severe in specimens thicker than 100 nm. 
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1.5.3 Accuracy of Quantification 


In EELS, the signal-intensity ratios depend only on the physics of the primary 
excitation and are largely independent of the spectrometer. Quantification need not 
involve the use of standards; measured core-loss intensities can be converted to ele- 
mental ratios using cross sections that are calculated for the collection angle, range 
of energy loss, and incident electron energy employed in the analysis. These cross 
sections are known to within 5% for most K-edges and 15% for most L-edges, the 
accuracy for other edges being highly variable (Egerton, 1993). EELS analysis of 
45-nm NiO films distributed to four laboratories yielded elemental ratios within 10% 
of stoichiometry (Bennett and Egerton, 1995); analysis of small areas of less-ideal 
specimens would give more variable results. 

In contrast, the relative intensities of EDX peaks depend on the properties of 
the detector. For thin specimens, this problem is addressed within the k-factor and 
¢-factor methods, but because detector parameters are not precisely known, these 
k-factors cannot be calculated with high accuracy. For the same reason, k-factors 
measured in other instruments serve only as a rough guide. To achieve an accuracy 
of better than 15%, the appropriate k-factors have usually been measured for each 
analyzed element, using test specimens of known composition and the same x-ray 
detector and microscope, operating at the same accelerating voltage (Williams and 
Carter, 2009). 

In the case of low-energy x-rays (e.g., K-peaks from light elements) the k-factors 
are dependent on x-ray absorption in the protective window and front end of the 
detector, and within the specimen itself. While it is possible to correct for such 
absorption, the accuracy of the correction is dependent on the specimen geometry, 
making the accuracy of light element EDX analysis generally worse than for heavier 
elements. 

Sometimes an overlap of peaks can prevent meaningful EDX analysis, as in the 
case of light element quantification using K-peaks when there are heavier elements 
whose L-peaks occur within 100 eV; see Fig. 1.11a. In the energy-loss spectrum, 
these edges overlap but are more easily distinguished because of the better energy 
resolution, close to | eV rather than 100 eV. Problems of background subtraction 
(e.g., at the Cr edge in Fig. 1.11b) can often be overcome by fitting the energy-loss 
spectrum to reference standards. 


1.5.4 Ease of Use and Information Content 


Changing from TEM or STEM imaging for recording an EDX spectrum typically 
involves positioning the incident beam, ensuring that probe current is not excessive 
(to avoid detector saturation), withdrawing any objective aperture, and inserting the 
EDX detector. Once set up, the detector and electronics require little maintenance, 
especially for detectors that do not require liquid-nitrogen cooling. EDX software 
has been developed to the point where elemental ratios are predicted in a routine 
fashion. Problems of peak overlap, absorption, and fluorescence can be important, 
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but not always. For these reasons, EDX spectroscopy remains the technique of 
choice for most TEM/STEM elemental analysis. 

Obtaining an energy-loss spectrum involves adjusting the spectrometer excitation 
(positioning the zero-loss peak), choosing an energy dispersion and collection angle, 
and verifying that the specimen is suitably thin. For some measurements, low spec- 
trum drift is important and this condition may involve waiting for the microscope 
high voltage and spectrometer power supplies to stabilize. Although improvements 
in software have made spectral analysis more convenient, the success of basic opera- 
tions such as the subtraction of instrumental and pre-edge backgrounds still depends 
on the skill of the operator and some understanding of the physics involved. 

To summarize, EELS is a more demanding technique than EDX spectroscopy 
in terms of the equipment, expertise, and knowledge required. In return for this 
investment, energy-loss spectroscopy offers greater elemental sensitivity for certain 
specimens and the possibility of additional information, including an estimate of the 
local thickness of a TEM specimen and information about its crystallographic and 
electronic structure. In fact, EELS provides data similar to x-ray, ultraviolet, visible, 
and (potentially) infrared spectroscopy, all carried out in the same instrument and 
with the possibility of atomic-scale spatial resolution (Brown, 1997). Obtaining this 
information is the subject of the remainder of this book; practical applications and 
limitations are discussed in Chapter 5. 


1.6 Further Reading 


The following chapters assume some familiarity with the operation of a transmis- 
sion electron microscope, a topic covered in many books, of which the 800-page 
Williams and Carter (2009) is the most readable and comprehensive. Reimer and 
Kohl (2008) give a thorough account of the physics and electron optics involved; 
Egerton (2005) treats those topics at an introductory level. Hirsch et al. (1977) 
remains a useful guide to diffraction-contrast imaging, while the text by De Graef 
(2003) provides a more modern account of electron diffraction and crystallography. 
Phase-contrast imaging is dealt with by Spence (2009) and by Buseck et al. (1988); 
reflection imaging and diffraction are reviewed by Wang (1993, 1996). 

The correction of electron lens aberrations has increased the performance of elec- 
tron microscopes and some of the implications are described in Hawkes (2008). 
A modern discussion of STEM techniques and achievements is contained in the 
book edited by Nellist and Pennycook (2011). Progress in time-resolved (fem- 
tosecond) TEM and EELS is well illustrated in Zewail and Thomas (2010). 
Spence (2005) provides a lucid introduction to diffractive (lensless) imaging and 
Chapman (2009) describe femtosecond diffractive imaging with a free-electron 
laser. 

Analytical electron microscopy is treated in volume 4 of Williams and Carter 
(2009) and in several multi-author volumes: Hren et al. (1979), Joy et al. (1986), 
and Lyman et al. (1990). A detailed review of alternative analytical techniques, 
based on lectures from the 40th Scottish Universities Summer School in Physics 
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was published in book form (Fitzgerald et al., 1992) but the two-volume Science of 
Microscopy edited by Hawkes and Spence (2008) is more recent and comprehensive. 

TEM-EELS is outlined in the microscopy handbook of Brydson (2001). Recent 
review articles dealing with basic principles of EELS include Colliex (2004), Spence 
(2006), Egerton (2009), and Garcia de Abajo (2010), the latter from a more theo- 
retical point of view. Materials science applications are described in some detail in 
Disko et al. (1992); this multi-author book was revised and edited by Ahn (2004), 
and now contains a digital CD version of the EELS Atlas (Ahn and Krivanek, 1983), 
giving low-loss and core-loss spectra of many solid elements and some compounds. 

Early progress in EELS is reviewed by Colliex (1984) and Marton et al. (1955). 
Other review articles of historical and general interest include Silcox (1979), Joy 
(1979), Joy and Maher (1980c), Isaacson (1981), Leapman (1984), Zaluzec (1988), 
Egerton (1992b), and Colliex et al. (1976a). 

The physics and spectroscopy of outer-shell excitation is well covered by Raether 
(1965), Daniels et al. (1970), and Raether (1980). Basic theory of inelastic scattering 
is given in Schattschneider (1986); magnetic (linear and chiral-dichroic) measure- 
ments are described in Schattschneider (2011). The benefits of energy filtering in 
TEM, as well as an account of the instrumentation and physics involved, are fully 
covered in the multi-author volume edited by Reimer (1995). The effect of inelastic 
scattering on TEM images and diffraction patterns is treated in depth by Spence and 
Zuo (1992) and by Wang (1995). 

EELS has been the subject of several workshops, which are represented by col- 
lected papers in Ultramicroscopy: vol. 110, no. 8 (EDGE 2009, Banff); vol. 106, 
nos. 11 and 12 (EDGE 2005, Grundlsee), vol. 96, nos. 3 and 4 (SALSA 2002, 
Guadeloupe), vol. 59, July 1995 (EELSI 1994, Leukerbad), and vol. 28, nos. 14 
(Aussois); also in Microscopy, Microanalysis, Microstructures: vol. 6, no. 1, Feb. 
1995 (Leukerbad) and vol. 2, nos. 2/3, April/June 1991 (Lake Tahoe). Workshops on 
electron spectroscopic imaging (ESI) are documented in Ultramicroscopy (vol. 32, 
no. 1, 1990: Tiibingen), and Journal of Microscopy (April 1991: Dortmund; vol. 6, 
Pt. 3, 1992: Munich). 

Several web sites contain information useful to researchers using EELS, 
including 


http://www.TEM-EELS.ca/ (includes teaching material and the software 
described in Appendix B) 

http://www.felmi-zfe.tugraz.at/ (includes a collection of Digital Micrograph 
scripts) 

http://www.gatan.com/software/ (Digital Micrograph scripting resources) 

http://www.public.asu.edu/~perkes/DMSUG.html (Digital Micrograph Scrip- 
ting Users Group) 

http://pc-web.cemes.fr/eelsdb/ (a database of energy-loss spectra), 

http://people.ccmr.cornell.edu/~davidm/WEELS/ (includes spectral data). 

http://unicorn.mcmaster.ca/corex/cedb-title.html (a database of core-loss spec- 
tra recorded from gases by Hitchcock and colleagues at McMaster 
University) 
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The related field of transmission energy-loss spectroscopy of gases is described 
by Bonham and Fink (1974) and Hitchcock (1989, 1994). For reflection-mode high- 
resolution spectroscopy (HREELS) at low incident electron energy, see Ibach and 
Mills (1982) and Kesmodel (2006). 

Specimen preparation is always important in transmission electron microscopy 
and is treated in depth for materials science specimens by Goodhew (1984). 
Chemical and electrochemical thinning of bulk materials is included in Hirsch et al. 
(1977). The book edited by Giannuzzi and Stevie (2005) deals with FIB methods. 
Ion milling is applicable to a wide range of materials and is especially useful for 
cross-sectional specimens (Bravman and Sinclair, 1984) but can result in changes in 
chemical composition within 10 nm of the surface (Ostyn and Carter, 1982; Howitt, 
1984). To avoid this, mechanical methods are attractive. They include ultramicro- 
tomy (Cook, 1971; Ball et al., 1984; Timsit et al., 1984; Tucker et al., 1985; see also 
papers in Microscopy Research and Technique, vol. 31, 1995, 265-310), cleavage 
techniques using tape or thermoplastic glue (Hines, 1975), small-angle cleavage 
(McCaffrey, 1993), and abrasion into small particles which are then dispersed on 
a support film (Moharir and Prakash, 1975; Reichelt et al., 1977; Baumeister and 
Hahn, 1976). Extraction replicas can be employed to remove precipitates lying close 
to a surface; plastic film is often used but other materials are more suitable for the 
extraction of carbides (Garratt-Reed, 1981; Chen et al., 1984; Duckworth et al., 
1984; Tatlock et al., 1984). 


Chapter 2 
Energy-Loss Instrumentation 


2.1 Energy-Analyzing and Energy-Selecting Systems 


Complete characterization of a specimen in terms of its inelastic scattering would 
involve recording the scattered intensity J(x, y, z, Oy, Oy, E) as a function of posi- 
tion (coordinates x, y, z) within the specimen and as a function of scattering angle 
(components @, and 6) and energy loss E. For an anisotropic crystalline specimen, 
the procedure would have to be repeated at different specimen orientations. Even 
if technically feasible, such measurements would involve storing a vast amount of 
information, so in practice the acquisition of energy-loss data is restricted to the 
following categories (see Fig. 2.1): 


(a) An energy-loss spectrum J(E) recorded at a particular point on the specimen 
or (more precisely) integrated over a circular region defined by an incident 
electron beam or an area-selecting aperture. Such spectroscopy (also known as 
energy analysis) can be carried out using a conventional transmission electron 
microscope (CTEM) or a scanning transmission electron microscope (STEM) 
producing a stationary probe, either of them fitted with a double-focusing 
spectrometer such as the magnetic prism (Sections 2.1.1 and 2.2). 

(b) A line spectrum J(y, E) or J(@y, E), where distance perpendicular to the E-axis 
represents a single coordinate in the image or diffraction pattern. This mode 
is obtained by using a spectrometer that focuses only in the direction of dis- 
persion, such as the Wien filter (Section 2.1.3). It can also be implemented by 
placing a slit close to the entrance of a double-focusing magnetic spectrometer, 
the slit axis corresponding to the nondispersive direction in the spectrome- 
ter image plane; this technique is sometimes called spatially resolved EELS 
(SREELS). 

(c) An energy-selected image J(x, y) or filtered diffraction pattern J(6,, 0y) recorded 
for a given energy loss E (or small range of energy loss) using CTEM or STEM 
techniques, as discussed in Section 2.6. 

(d) A spectrum image J(x, y, E) obtained by acquiring an energy-loss spectrum at 
each pixel as a STEM probe is rastered over the specimen (Jeanguillaume and 
Colliex, 1989). Using a conventional TEM fitted with an imaging filter, the same 
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Fig. 2.1 Energy-loss data obtainable from (a) a fixed-beam TEM fitted with a double-focusing 
spectrometer; (b) CTEM with line-focus spectrometer or double-focusing spectrometer with an 
entrance slit; (¢) CTEM operating with an imaging filter or STEM with a serial-recording spec- 
trometer; (d) STEM fitted with a parallel-recording spectrometer or CTEM collecting a series of 
energy-filtered images 


information can be obtained by recording a series of energy-filtered images at 
successive energy losses, sometimes called an image spectrum (Lavergne et al., 
1992). This corresponds in Fig. 2.1d to acquiring the information from succes- 
sive layers, rather than column by column as in the STEM method. By filtering 
a diffraction pattern or using a rocking beam (in STEM) it is also possible to 
record the J(6,, 0), E) data cube. 


Details of the operation of these energy-loss systems are discussed in 
Sections 2.3, 2.4, 2.5, and 2.6. In the next section, we review the kinds of spectrom- 
eter that have been used for TEM-EELS, where an incident energy of the order of 
10° eV is necessary to avoid excessive scattering in the specimen. Since an energy 
resolution better than | eV is desirable, the choice of spectrometer is limited to 
those types that offer high resolving power, which rules out techniques such as 
time-of-flight analysis that are used successfully in other branches of spectroscopy. 


2.1.1 The Magnetic Prism Spectrometer 


In the magnetic prism spectrometer, electrons traveling at a speed v in the z-direction 
are directed between the poles of an electromagnet whose magnetic field B is in the 
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y-direction, perpendicular to the incident beam. Within this field, the electrons travel 
in a circular orbit whose radius of curvature R is given by 


R= (ymo/eB)v (2.1) 


where y = 1/(1— v / c7)l/ ? is a relativistic factor and mo is the rest mass of an elec- 
tron. The electron beam emerges from the magnet having been deflected through an 
angle @; often chosen to be 90° for convenience. As Eq. (2.1) indicates, the precise 
angular deflection of an electron depends on its velocity v within the magnetic field. 
Electrons that have lost energy in the specimen have a lower value of v and smaller 
R, so they leave the magnet with a slightly larger deflection angle (Fig. 2.24). 
Besides introducing bending and dispersion, the magnetic prism also focuses an 
electron beam. Electrons that originate from a point object O (a distance u from the 
entrance of the magnet) and deviate from the central trajectory (the optic axis) by 
some angle y, (measured in the radial direction) will be focused into an image J, a 


Fig. 2.2. Focusing and dispersive properties of a magnetic prism. The coordinate system rotates 
with the electron beam, so the x-axis always represents the radial direction and the z-axis is the 
direction of motion of the central zero-loss trajectory (the optic axis). Radial focusing in the xz 
plane (the first principal section) is represented in (a); the trajectories of electrons that have lost 
energy are indicated by dashed lines and the normal component B,, of the fringing field is shown 
for the case y > 0. Axial focusing in the y—z plane (a flattened version of the second principal 
section) is illustrated in (b) 
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distance v, from the exit of the magnet; see Fig. 2.2a. This focusing action occurs 
because electrons with positive y, travel a longer distance within the magnetic field 
and therefore undergo a larger angular deflection, so they return toward the optic 
axis. Conversely, electrons with negative y, travel a shorter distance in the field are 
deflected less and converge toward the same point /,. To a first approximation, the 
difference in path length is proportional to y,, giving first order focusing in the x—z 
plane. If the edges of the magnet are perpendicular to the entrance and exit beam 
(€1 = €2 = 0), points O, [,, and C (the center of curvature) lie in a straight line 
(Barber’s rule); the prism is then properly referred to as a sector magnet and focuses 
only in the x-direction. If the entrance and exit faces are tilted through positive angles 
€, and & (in the direction shown in Fig. 2.2a), the differences in path length are less 
and the focusing power in the x—z plane is reduced. 

Focusing can also take place in the y—z plane (i.e., in the axial direction, paral- 
lel to the magnetic field axis), but this requires a component of magnetic field in 
the radial (x) direction. Unless a gradient field design is used (Crewe and Scaduto, 
1982), such a component is absent within the interior of the magnet, but in the fring- 
ing field at the polepiece edges there is a component of field B, (for y 4 0) that is 
normal to each polepiece edge (see Fig. 2.2a). Provided the edges are not perpen- 
dicular to the optic axis (¢; #4 0 # €2), By itself has a radial component B, in 
the x-direction, in addition to its component B, along the optic axis. If ¢; and €2 
are positive (so that the wedge angle a is less than the bend angle ¢), B, > O for 
y > O and the magnetic forces at both the entrance and exit edges are in the negative 
y-direction, returning the electron toward a point J, on the optic axis. Each boundary 
of the magnet therefore behaves like a convex lens for electrons traveling in the y—z 
plane (Fig. 2.2b). 

In general, the focusing powers in the x- and y-directions are unequal, so that 
line foci I, and I, are formed at different distances v, and v, from the exit face; in 
other words, the device exhibits axial astigmatism. For a particular combination of 
€ 1 and €2, however, the focusing powers can be made equal and the spectrometer is 
said to be double focusing. In the absence of aberrations, electrons originating from 
O would all pass through a single point J, a distance vy, = vy = v from the exit. A 
double-focusing spectrometer therefore behaves like a normal lens; if an extended 
object were placed in the x—y plane at point O, its image would be produced in the 
x-y plane passing through /. But unlike the case of an axially symmetric lens, this 
two-dimensional imaging occurs only for a single value of the object distance u. If u 
is changed, a different combination of ¢; and €2 is required to give double focusing. 

Like any optical element, the magnetic prism suffers from aberrations. For 
example, aperture aberrations cause an axial point object to be broadened into an 
aberration figure (Castaing et al., 1967). For the straight-edged prism shown in 
Fig. 2.2a, these aberrations are predominantly second order; in other words, the 
dimensions of the aberration figure depend on ye and ve: Fortunately, it is possible 
to correct these aberrations by means of sextupole elements or by curving the edges 
of the magnet, as discussed in Section 2.2. 

For energy analysis in the electron microscope, a single magnetic prism is the 
most frequently used type of spectrometer. This popularity arises largely from 
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the fact that it can be manufactured as a compact, add-on attachment to either 
a conventional or a scanning transmission microscope without affecting its basic 
performance and operation. The spectrometer is not connected to the microscope 
high-voltage supply, so a magnetic prism can be used even for accelerating voltages 
exceeding 500 keV (Darlington and Sparrow, 1975; Perez et al., 1975). However, 
good energy resolution demands a magnet current supply of very high stability 
and requires the high-voltage supply of the microscope to be equally stable. As 
the dispersive power is rather low, values of around 2 um/eV being typical for 100- 
keV electrons, good energy resolution requires finely machined detector slits (for 
serial acquisition) or post-spectrometer magnifying optics (in the case of a parallel 
recording detector). On the other hand, the dispersion is fairly linear over a range of 
2000 eV, making the magnetic prism well suited to parallel recording of inner-shell 
losses. 


2.1.2 Energy-Filtering Magnetic Prism Systems 


As discussed in Section 2.1.1, the edge angles of a magnetic prism can be chosen so 
that electrons coming from a point object will be imaged to a point on the exit side 
of the prism, for a given electron energy. In other words, there are points R; and 
R> that are fully stigmatic and lie within a real object and image, respectively (see 
Fig. 2.3a). Because of the dispersive properties of the prism, the plane through Ro 
will contain the object intensity convolved with the electron energy-loss spectrum 
of the specimen. Electron optical theory (Castaing et al., 1967; Metherell, 1971) 
indicates that, for the same prism geometry, there exists a second pair of stigmatic 
points V; and V2 (Fig. 2.3b) that usually lie within the prism and correspond to 
virtual image points. Electrons that are focused so as to converge on V; appear 
(after deflection by the prism) to emanate from V2. If an electron lens were used to 
produce an image of the specimen at the plane passing through Vj, a second lens 
focused on V2 could project a real image of the specimen from the electrons that 
have passed through the prism. An aperture or slit placed at R2 would transmit only 
those electrons whose energy loss lies within a certain range, so the final image 
would be an energy-filtered (or energy-selected) image. 

Ideally, the image at V2 should be achromatic (see Fig. 2.3), a condition that can 
be arranged by suitable choice of the object distance (location of point R;) and prism 
geometry. In that case, the prism introduces no additional chromatic aberration, 
regardless of the width of the energy-selecting slit. 

In order to limit the angular divergence of the rays at Rj (so that spectrometer 
aberrations do not degrade the energy resolution) while at the same time ensur- 
ing a reasonable field of view at the specimen, the prism is ideally located in the 
middle of a CTEM column, between the objective and projector lenses. A single 
magnetic prism is then at a disadvantage: it bends the electron beam through a large 
angle, so the mechanical stability of a vertical lens column would be lost. In pre- 
ference, a multiple deflection system is used, such that the net angle of deflection 
is zero. 
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2.1.2.1 Prism—Mirror Filter 


A filtering device first developed at the University of Paris (Castaing and Henry, 
1962) consists of a uniform field magnetic prism and an electrostatic mirror. 
Electrons are deflected through 90° by the prism, emerge in a horizontal direction, 
and are reflected through 180° by the mirror so that they enter the magnetic field 
a second time. Because their velocity is now reversed, the electrons are deflected 
downward and emerge from the prism traveling in their original direction, along the 
vertical axis of the microscope. 

In Fig. 2.3, Ro and V2 act as real and virtual objects for the second magnetic 
deflection in the lower half of the prism, producing real and virtual images R3 and 
V3, respectively. To achieve this, the mirror must be located such that its apex is at 
R>» and its center of curvature is at V2, electrons being reflected from the mirror back 
toward V2. Provided the prism itself is symmetrical (¢; = ¢3 in Fig. 2.3), the virtual 
image V3 will be achromatic and at the same distance from the midplane as Vj 
(Castaing et al., 1967; Metherell, 1971). In practice, the electrostatic mirror consists 
of a planar and an annular electrode, both biased some hundreds of volts negative 
with respect to the gun potential of the microscope. The apex of the mirror depends 


Mirror 


Energy-Selecting Slit__ 


Fig. 2.3. Imaging properties of a magnetic prism, showing (a) real image points (Rj, Ro, and 
R3) and (b) virtual image points (V;, V2, and V3) and an achromatic point O. In this example, 
electrons are reflected back through the prism by an electrostatic mirror, whose apex and curvature 
are adjusted by bias voltages V, and V. 
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on the bias applied to both electrodes; the curvature can be adjusted by varying the 
voltage difference between the two (Henkelman and Ottensmeyer, 1974b). 

Although not essential (Castaing, 1975), the position of R; can be chosen (for 
a symmetric prism, €; = &€3) such that the point focus at R3 is located at the same 
distance as R; from the midplane of the system. Since the real image at Rp» is chro- 
matic (as discussed earlier) and since the dispersion is additive during the second 
passage through the prism, the image at R3 is also chromatic; if Rj is a point object, 
R3 contains an energy-loss spectrum of the sample and an energy-selecting aperture 
or slit placed at R3 will define the range of energy loss contributing to the image V3. 
The latter is converted into a real image by the intermediate and projector lenses of 
the TEM column. Because V3 is achromatic, the resolution in the final image is (to 
first order) independent of the width of the energy-selecting slit, which ensures that 
the range of energy loss can be made sufficiently large to give good image intensity 
and that (if desired) the energy-selecting aperture can be withdrawn to produce an 
unfiltered image. 

In the usual mode of operation of an energy-selecting TEM, a low-excitation 
“post-objective” lens forms a magnified image of the specimen at V; and a demag- 
nified image of the back-focal plane of the objective lens at R;. In other words, the 
object at R; is a portion (selected by the objective aperture) of the diffraction pattern 
of the specimen: the central part, for bright-field imaging. With suitable operation 
of the lens column, the location of the specimen image and diffraction pattern can 
be reversed, so that energy-filtered diffraction patterns are obtained (Henry et al., 
1969; Egerton et al., 1975; Egle et al., 1984). 

In addition, the intermediate lens excitation can be changed so that the intensity 
distribution at R3 is projected onto the TEM screen. The energy-loss spectrum can 
then be recorded in a parallel mode (using a CCD camera) or serially (by scanning 
the spectrum past an aperture and electron detection system). If the system is slightly 
misaligned, a line spectrum is produced (Henkelman and Ottensmeyer, 1974a; 
Egerton et al., 1975) rather than a series of points, more convenient because the 
lower current density results in less risk of damage in a scintillator or contamination 
on energy-selecting slits. 

The original Castaing—Henry system was improved by curving the prism edges 
to reduce second-order aberrations (Andrew et al., 1978; Jiang and Ottensmeyer, 
1993), allowing a greater angular divergence at R; and therefore a larger field of 
view in the energy-filtered image, for a given energy resolution. Although the mir- 
ror potential is tied to the microscope high voltage, the dispersion of the system 
arises entirely from the magnetic field. Therefore, good energy resolution is depen- 
dent upon stability of the high-voltage supply, unlike electrostatic or retarding field 
analyzers. 


2.1.2.2 Omega, Alpha, and MANDOLINE Filters 


A more recent approach to energy filtering in a CTEM takes the form of a purely 
magnetic device known as the omega filter. After passing through the specimen, the 
objective lens, and a low-excitation post-objective lens, the electrons pass through 
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a magnetic prism and are deflected through an angle ¢, typically 90-120°. They 
then enter a second prism whose magnetic field is in the opposite direction, so the 
beam is deflected downward. A further two prisms are located symmetrically with 
respect to the first pair and the complete trajectory takes the form of the Greek letter 
Q (Fig. 2.4). The beam emerges from the device along its original axis, allowing 
vertical alignment of the lens column to be preserved. The dispersion within each 
magnetic prism is additive and an energy-loss spectrum is formed at a position D2 
that is conjugate with the object point D1; see Fig. 2.4. For energy-filtered imaging, 
Dj and D» contain diffraction patterns while planes O; and O3 (located just outside 
or inside the first and last prisms) contain real or virtual images of the specimen. An 
energy-selected image of the specimen is produced by using an energy-selecting slit 
in plane D2 and a second intermediate lens to image O3 onto the TEM screen (via 
the projector lens). Alternatively, the intermediate lens can be focused on Dp in order 


Fig. 2.4 Optics of an 
aberration-corrected omega 
filter (Pejas and Rose, 1978). 
Achromatic images of the 
specimen are formed at O1, 
Oz, and O3; the plane through 
Dp» contains an 
energy-dispersed diffraction 
pattern. Sextupole lenses (S) 
placed close to Dj, Oz and D2 
correct for image-plane tilt 
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to record the energy-loss spectrum. If the post-objective lens creates an image and 
diffraction pattern at D; and O,, respectively, an energy-filtered diffraction pattern 
occurs at O3. 

As a result of the symmetry of the omega filter about its midplane, second-order 
aperture aberration and second-order distortion vanish if the system is properly 
aligned (Rose and Plies, 1974; Krahl et al., 1978; Zanchi et al., 1977b). The remain- 
ing second-order aberrations can be compensated by curving the polefaces of the 
second and third prisms, using sextupole coils symmetrically placed about the mid- 
plane (Fig. 2.4) and operating the system with line (instead of point) foci between 
the prisms (Pejas and Rose, 1978; Krahl et al., 1978; Lanio, 1986). 

Unlike the prism—mirror system, the omega filter does not require connection to 
the microscope accelerating voltage. As a result, it has become a preferred choice 
for an energy-filtering TEM that employs an accelerating voltage above 100 keV 
(Zanchi et al., 1977a, 1982; Tsuno et al., 1997). Since a magnetic field of the same 
strength and polarity is used in the second and third prisms, these two can be com- 
bined into one (Zanchi et al., 1975), although this design does not allow a sextupole 
at the midplane. 

Another kind of all-magnetic energy filter consists of two magnets whose field 
is in the same direction but of different strength; the electrons execute a trajectory 
in the form of the Greek letter a (see later, Fig. 2.6b). An analysis of the first-order 
imaging properties of the alpha filter is given by Perez et al. (1984). 

In the MANDOLINE filter (Uhlemann and Rose, 1994) the first and last prisms 
of the omega design are combined into one and multipole correction elements are 
incorporated between all three prisms. This filter provides relatively high dispersion 
(6 wm/eV at 200 kV) and has been used as a high-transmissivity imaging filter in 
the Zeiss SESAM monochromated TEM. 


2.1.3 The Wien Filter 


A dispersive device employing both magnetic and electrostatic fields was reported in 
1897 by W. Wien and first used with high-energy electrons by Boersch et al. (1962). 
The magnetic field (induction B in the y-direction) and electric field (strength EF, 
parallel to the x-axis) are both perpendicular to the entrance beam (the z-axis). 
The polarities of these fields are such that the magnetic and electrostatic forces 
on an electron are in opposite directions; their relative strengths obey the relation- 
ship E = vB such that an electron moving parallel to the optic axis with speed 
v; and kinetic energy E; continues in a straight line, the net force on it being 
zero. Electrons traveling at some angle to the optic axis or with velocities other 
than v; execute a cycloidal motion (Fig. 2.5) whose rotational component is at the 
cyclotron frequency: w@ = eB/ympo, where e and mp are the electron charge and rest 
mass and ymo is the relativistic mass. Electrons starting from a point (z = 0) on 
the optic axis and initially traveling in the xz plane return to the z-axis after one 
complete revolution; in other words, an achromatic focus occurs at z = 2L, where 
L = (rv\/o) = rymE /eB?. In addition, an inverted chromatic image of unit 
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Fig. 2.5 Wien filter spectrometer for a scanning-transmission microscope (Batson, 1985). Electron 
trajectories are shown (a) in the dispersive (x—z) plane and (b) in the nondispersive (—z) plane. 
Dashed lines represent electrons that have lost energy in the specimen. A quadrupole lens has been 
added to make the system approximately double focusing 


magnification (VM, = —1) is formed at z = L (..e., after half a revolution), its energy 
dispersion being L/(z E1) (Curtis and Silcox, 1971). Velocity components along the 
y-axis (magnetic field direction) are unaffected by the magnetic and electrostatic 
fields, so both the chromatic and achromatic images are actually line foci. 
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The Wien filter is generally used with decelerated electrons. In other words, the 
filter is operated at a potential —Vo + Vi which is close to the negative potential 
—Vo of the electron source. The positive bias Vj is obtained from a power supply 
connected to the high-voltage line; its value, typically in the range 100-1000 eV, 
determines the energy (eV) of the electrons that can move in a straight line through 
the filter. The retarding and accelerating fields at the entrance and exit of the filter 
act as electrostatic lenses (Fig. 2.5), whose effect must be taken into account in the 
design of the system. 

Although retardation involves the inconvenience of handling high voltages, it 
provides several advantages. First of all, the dispersion at the chromatic focus is 
increased by a factor Vo/V, for a given length L of the filter; values of 100 j1m/eV or 
more are typical. The electrostatic lens at the exit of the filter can be used to project 
the spectrum onto the detection plane, with either a decrease or a further increase in 
the dispersion, depending on the distance of the final image. Second, the required 
magnitudes and stabilities of B and E are reduced and the mechanical tolerances of 
the polepieces and electrodes are relaxed. Third, because the electron velocity for 
straight-line transmission depends on Vj rather than Vo, fluctuations and drift in Vo 
do not affect the energy resolution. This factor is particularly important where high 
resolution must be combined with long recording times, for example, when record- 
ing inner-shell losses using a field-emission STEM (Batson, 1985). A Wien filter 
used in conjunction with a monochromator (Section 2.1.4) achieved an energy reso- 
lution of 5 meV for 30-keV electrons (Geiger et al., 1970), yielding spectra showing 
vibrational and phonon modes of energy loss (Fig. 1.9). These vibrational modes 
can be studied by infrared absorption spectroscopy but EELS offers the potential of 
much better spatial resolution. 

Because the system just discussed does not focus in the y-direction, the energy- 
loss spectrum is produced as a function of distance along a straight line in the 
entrance plane, this line being defined by an entrance slit. If a diffraction pattern 
(or a magnified image) of the specimen is projected onto the slit plane, using the 
lenses of a CTEM, the final image will contain a map of electron intensity as 
a function of both energy loss and scattering angle (or specimen coordinate). A 
two-dimensional sensor placed at the final-image plane can therefore record a large 
amount of information about the specimen (Batson and Silcox, 1983). 

The Wien filter can become double focusing if either E or B is made nonuni- 
form, for example, by curving the electric field electrodes, by tilting the magnetic 
polepieces to create a magnetic field gradient, or by shaping both the electric and 
magnetic fields to provide a quadrupole action (Andersen, 1967). The device is then 
suitable for use as an imaging filter in a fixed-beam TEM (Andersen and Kramer, 
1972). Aberrations of the filter can be corrected by introducing multipole elements 
(Andersen, 1967; Martinez and Tsuno, 2008). 


2.1.4 Electron Monochromators 


Besides being dependent on the spectrometer, the energy resolution of an energy 
analysis system is limited by energy spread in the electron beam incident on 
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the specimen. If the electrons are produced by a thermionic source operated 
at a temperature 7;, the energies of the electrons leaving the cathode follow a 
Maxwellian distribution, whose full width at half maximum (FWHM) is AEF, = 
2.45(kT;) (Reimer and Kohl, 2008). For a tungsten filament whose emission surface 
is at a temperature of 2800 K, AE, = 0.6 eV; for a lanthanum hexaboride source at 
1700 K, AE, = 0.3 eV. Values are lower for Schottky and field-emission sources. 


2.1.4.1 The Boersch Effect 


The energy spread AZo measured in an electron microscope is always larger than 
AEs, the discrepancy being referred to as the Boersch effect, since Boersch (1954) 
first investigated the dependence of the measured spread on physical parameters of 
the electron microscope: cathode temperature, Wehnelt electrode geometry, Wehnelt 
bias, accelerating voltage, vacuum conditions, and the deployment of magnetic and 
electrostatic lenses. He found that AEp increases with the emission current and is 
further increased when the beam is focused into a crossover. Subsequent experi- 
mental work (Martin and Geissler, 1972; Ditchfield and Whelan, 1977; Bell and 
Swanson, 1979) confirmed these findings. 

When electrons are rapidly accelerated to an energy Eo, their energy spread 6E 
remains unaltered, in accordance with the conservation of energy. However, the 
axial velocity spread dv, is reduced as the axial velocity vz increases, since (nonrela- 
tivistically) bE = 5(mov? /2), giving dv, = AE;/(movz). The equivalent axial beam 
temperature attained is T, = (kT,/Eo)T; (Knauer, 1979) and is very low (<0.1 K) 
for Eo > 10 keV. If the electrons spend enough time in sufficiently close proximity 
to one another, so that they interact via Coulomb forces, the difference between the 
axial and transverse temperatures is reduced, raising dv, and increasing the mea- 
sured energy spread. This is known as the “thermal” Boersch effect; the resulting 
value of AE depends on the path length of the electrons and on the current density 
(Knauer, 1979). 

In addition, electrons that are focused into a crossover can suffer “collision 
broadening” through interaction between their transverse velocity components. The 
energy broadening depends on the current density at the crossover and on the diver- 
gence angle (Crewe, 1978; Knauer, 1979; Rose and Spehr, 1980). The beam current 
is highest within the electron gun, so appreciable broadening can occur at a gun 
crossover. A cold field-emission (CFEG) source provides the lowest energy spread 
(0.3 eV) at low emission currents (<10 nA) but AEp increases to as much as | eV at 
100 nA emission, due to Coulomb interaction of electrons just outside the tip (Bell 
and Swanson, 1979). 


2.1.4.2 Types of Monochromator 


The Wien filter offers a high dispersion and good energy resolution (a few mil- 
lielectron volts) when operated with low-velocity electrons (Section 2.1.3). It can 
therefore be used to produce an incident beam of small energy width if an energy- 
selecting aperture is placed in an image of its chromatic focus (Boersch et al., 1962, 
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1964). A second Wien filter (after the specimen) can act as an energy analyzer, 
making possible energy-loss spectroscopy of vibrational modes (Boersch et al., 
1962; Katterwe, 1972; Geiger, 1981). An energy resolution below 6 meV was 
eventually achieved at 30-keV incident energy. 

For analysis of small areas of a TEM specimen, a higher accelerating voltage 
is useful and a field-emission source maximizes the fraction of electrons allowed 
through the monochromator (this fraction is approximately the required resolution 
divided by the energy width of the source). A JEOL-1200EX instrument fitted with 
two retarding Wien filters and operated at 80 kV achieved an energy resolution 
of typically 80 meV (Terauchi et al., 1994) or down to 30 meV for energy losses 
below 5 eV. The microscope accelerating voltage was applied to the analyzing filter 
to decelerate the electrons, so high-voltage fluctuations were compensated in this 
design. 

An alternative strategy is to place the monochromator within the electron gun, 
where high dispersion is possible because the electrons have undergone only a lim- 
ited amount of acceleration. Mook and Kruit (2000) designed a short Wien filter 
for a high-resolution field-emission STEM (Batson et al., 2000, 2001). Its small 
length (4 mm) reduced the collision-broadening effect but resulted in a low energy 
dispersion (D4 \1m/eV) for the 800-eV electrons passing through the filter, requir- 
ing a very narrow (200 nm) energy selecting to achieve an energy spread in the 
50-100 meV range. 

For FEI microscopes, Tiemeijer et al. (2001) developed a longer (50 mm) gun 
monochromator. Electrons are dispersed and then accelerated before reaching the 
energy-selecting slit, making the latter less sensitive to electrostatic charging. The 
system has demonstrated a resolution down to 100 meV at low beam current. 

The design of Tsuno (2000) is a double Wien filter in which the second half of the 
filter compensates for energy broadening within the energy-selecting slit. This pro- 
cedure increases the electron-optical brightness by typically a factor of 3 compared 
to the single Wien filter design. Since electrons are removed by the energy-selecting 
slit, the beam current is still reduced, by a factor about equal to the improvement in 
energy resolution. Tsuno et al. (2005) have reported that a Wien filter is capable of 
acting as both a monochromator and a lens aberration corrector. 

The monochromator design of Rose (1990) is an electrostatic version of the 
omega filter. An energy-selecting slit is placed at its mid-plane (equivalent to O2 in 
Fig. 2.4) and the second half of the filter compensates for energy dispersion within 
the slit, optimizing the source brightness. This design was commercialized by CEOS 
GmbH and used in the Zeiss SESAM microscope, where it has demonstrated an 
energy resolution below 0.1 eV in conjunction with the MANDOLINE energy fil- 
ter. Advantages of the electrostatic design include low drift (absence of magnetic 
hysteresis effects) and avoidance of a high-stability current supply running at high 
potential. Disadvantages are its more complex electron optics and its fixed disper- 
sion: the width of the energy-selecting slit must be changed in order to vary the 
energy resolution. 

A gun monochromator does not compensate for instabilities of the high-voltage 
supply and the same is true for designs that use the microscope’s high voltage to 
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decelerate the electrons inside the monochromator. Unless this same voltage supply 
is used to decelerate electrons within the spectrometer, superior design of the high- 
voltage supply is necessary to achieve 0.1-eV energy resolution (Tiemeijer et al., 
2001). Such resolution under practical conditions has also required improved spec- 
trometer design (Brink et al., 2003) and careful attention to minimizing magnetic 
fields in the microscope environment (Muller and Grazul, 2001). Slow drift can be 
compensated by storing multiple readouts of the spectrum and shifting them under 
computer control; even with no monochromator, this technique has yielded a spec- 
tral resolution of 0.3 eV (Kimoto and Matsui, 2002) and a precision of 0.1 eV for 
recording ionization edges (Potapov and Schryvers, 2004). 

Krivanek et al. (2009) designed a system employing two purely magnetic non- 
decelerating filters for the monochromator and analyzer, as shown in Fig. 2.6a. The 
monochromator is an alpha filter with an energy-selecting slit at its mid-point, the 
second half of the filter canceling the dispersion of the first half. It uses the same 
current as the analyzer, so that the latter tracks the energy selected by the monochro- 
mator, current drift having no effect on the selected energy. Current absorbed by the 
two halves of the energy-selecting slit is used as feedback to the high-voltage gen- 
erator so that the beam remains centered on the slit (Kruit and Shuman, 1985a). 
Quadrupoles placed before and after the monochromator slit magnify the disper- 
sion from 0.3 jtm/eV to about 200 jpm/eV, so a 2-\m slit would select 10 meV 
energy width; see Fig. 2.6b. First- and second-order prism focusing is provided 
by quadrupole and sextupole lenses, rather than by tilt and curvature of the pole- 
piece edges. The sextupoles are also designed to correct for chromatic aberration 
of the probe-forming condenser lenses. For core-loss analysis, monochromation is 
less essential, so the alpha-filter field could be canceled by diverting the current into 
separate windings, allowing a straight-line path through the system but preserving 
the same heat dissipation (to avoid temperature change and mechanical drift). The 
energy resolution would then be ~0.3 eV, assuming a cold field-emission source 
running at 200 kV. 


2.1.4.3 Dispersion Compensation 


One disadvantage of a conventional monochromator system is that the monochro- 
mator reduces the beam current by a large factor if the energy spread of the electron 
source (including the Boersch effect) greatly exceeds the required energy resolution. 
Low beam implies longer recording times. A remedy is to eliminate the energy- 
selecting slit of the monochromator and use a completely symmetrical system of 
monochromator and analyzer. 

For example, if the length of a Wien filter is extended to a value 2L = 
2x ympE/eB*, an achromatic focus is formed at the exit plane, as discussed in 
Section 2.1.3. Because the final image is achromatic, its width is independent of 
the incident energy spread AZo, a result of the fact that the chromatic aberration of 
the second half of the system exactly compensates that of the first half. If a specimen 
is now introduced at the chromatic focus (z = L), the resulting energy losses pro- 
vide an energy-loss spectrum at z = 2L but the width of each spectral line remains 
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Fig. 2.6 (a) Design of a monochromated TEM in which energy drift is minimized by using the 
same excitation current for both the monochromator and spectrometer and where intensity drift is 
minimized by feedback from the monochromator slit, so that the beam remains centered at the slit. 
(b) Design of the a-filter monochromator, incorporating three magnetic prisms, quadrupoles (Q), 
and sextupoles (S$); electrons travel upward from a crossover provided by the condenser | lens. 
From Krivanek et al. (2009), courtesy of the Royal Society, London 


independent of the value of AZo. In addition, second-order aperture aberrations of 
the second half cancel those of the first, so the energy resolution of the system (if 
perfectly symmetrical) depends only on higher order aberrations and the object size 
of the electron source. In practice, the two halves of the double Wien filter can be 
separated by a short distance (to allow room for inserting the specimen) but great 
care has to be taken to keep the system symmetrical. An experimental prototype 
based on this principle (Andersen and Le Poole, 1970) achieved an energy resolu- 
tion of 50 meV (measured without a specimen) using 10-keV transmitted electrons. 
Scattering in the sample degraded this resolution to about 100 meV but a transmitted 
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current of up to 0.1 wA was available. The system can be made double focusing at 
the chromatic image by using two independently excited field coils in each Wien fil- 
ter (Andersen, 1967), thereby reducing the y-spreading of the beam at the specimen. 
Some spreading in the x-direction is unavoidable, because of the energy spread of 
the electron source and the dispersion produced by the first Wien filter, and would 
limit the spatial resolution of analysis. 

The same principle (known as dispersion compensation or dispersion matching) 
has been employed in nuclear physics, the target being placed between a pair of 
magnetic sector spectrometers which bend a high-energy beam of electrons in the 
same direction (Schaerf and Scrimaglio, 1964). It has been applied to reflection 
spectroscopy of low-energy (e.g., 5 eV) electrons, using two identical spherical 
electrostatic sectors (Kevan and Dubois, 1984). 

Instead of a monochromator, it would be attractive to have a high-brightness 
electron source with low energy width. Fransen et al. (1999) examined the field 
emission properties of individual multiwall carbon nanotubes, mounted on the end 
of a tungsten wire. In ultrahigh vacuum, the emission was highly stable (less than 
10% variation in 50 days) even without “flashing” the tip. Energy widths in the 
range 0.11—0.2 eV were measured, the source brightness being roughly equivalent 
to that of a cold tungsten field emitter. Subsequent work has confirmed that high 
electron-optical brightness may be achievable from a carbon nanotube (De Jonge 
et al., 2002); a value of 6 x 10°A/cm*/sr! at 200 kV, together with good stability, 
has been reported (Houdellier et al., 2010). 


2.2 Optics of a Magnetic Prism Spectrometer 


As discussed in Section 2.1.1, a magnetic prism spectrometer produces three effects 
on a beam of electrons: bending, dispersion, and focusing. Focusing warrants the 
most attention, since the attainable energy resolution depends on the width of the 
exit beam at the dispersion plane. Provided the spatial distribution of the magnetic 
field is known, the behavior of an electron within the spectrometer can be predicted 
by applying equations of motion, based on Eq. (2.1), to each region of the trajectory. 
Details are given in Penner (1961), Brown et al. (1964), Brown (1967), and Enge 
(1967). The aim of this section is to summarize the results of such analysis and 
to provide an example of the use of a matrix computer program for spectrometer 
design. 

We will adopt the coordinate system and notation of Brown et al. (1964), widely 
used in nuclear physics. For a negative particle such as the electron, the y-axis is 
antiparallel to the direction of magnetic field. The z-axis always represents the direc- 
tion of motion of an electron traveling along the central trajectory (the optic axis); in 
other words, the coordinate system rotates about the y-axis as the electron proceeds 
through the magnetic field. The x-axis is perpendicular to the y- and z-axes and 
points radially outward, away from the center of curvature of the electron trajecto- 
ries. Using this curvilinear coordinate system, the y-axis focusing can be represented 
on a flat y—z plane (Fig. 2.2b). 
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The behavior of electrons at the entrance and exit edges of the magnet is simpler 
to calculate if the magnetic field is assumed to remain constant up to the polepiece 
edges, dropping abruptly to zero outside the magnet. This assumption is known as 
the SCOFF (sharp cutoff fringing field) approximation and is more likely to be real- 
istic if the gap between the polepieces (measured in the y-direction) is very small. 
In practice, the field strength just inside the magnet is less than in the interior, and a 
fringing field extends some distance (of the order of the gap length) outside the geo- 
metrical boundaries. In the EFF (effective fringing field) model, the z-dependence 
of field strength adjacent to the magnet boundaries is specified by one or more 
coefficients, leading to a more accurate prediction of the spectrometer focusing. 

An important concept is the order of the focusing. Formation of an image is a 
first-order effect, so first-order theory is used to predict object and image distances, 
image magnifications, and dispersive power, the latter being first order in energy 
loss. Second- or higher order analysis is needed to describe image aberration and 
distortion, together with other properties such as the dispersion-plane tilt. 


2.2.1 First-Order Properties 


We first consider the “radial” focusing of electrons that originate from a point object 
O located a distance u from the entrance face of a magnetic prism (Fig. 2.2). For 
a particular value of u, all electrons that arrive at the center of the magnet (after 
deflection through an angle ¢/2) are traveling parallel to the optic axis before being 
focused by the second half of the prism into a crossover (or image) J, located a 
distance v, from the exit edge. We can regard these particular values of u and v,; 
as being the focal lengths f, of the first and second halves of the prism, and their 
reciprocals as the corresponding focusing powers. In the SCOFF approximation, 
these focusing powers are given by (Wittry, 1969) 


1/f; = [tan(/2) — tane]/R (2.2) 


where ¢ = & for the first half of the prism and ¢ = €2 for the second half, ¢; and 
€2 being the tilt angles of the prism edges. Note that a positive value of ¢ reduces 
the radial focusing power, leading to longer object and image distances. In this case, 
the boundaries of the magnet have a divergent focusing action, whereas the effect 
of the uniform field in the center of the magnet is to gradually return the electrons 
toward the optic axis, as illustrated in Fig. 2.7. 

In contrast, focusing in the axial (y-) direction takes place only at the entrance 
and exit of the magnet. Each boundary can be characterized by a focusing power 
that is given, in the SCOFF approximation, by 


1/fy = tan(e)/R (2.3) 


Because 1/f, and 1/f, change in opposite directions as ¢ is varied, the entrance- 
and exit-face tilts can be chosen so that the net focusing powers in the radial and 
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Fig. 2.7 Trajectories of electrons through a magnetic prism spectrometer. Solid lines represent the 
component of motion in the x—z plane (first principal section); dashed lines represent motion in the 
y-z plane (second principal section). The horizontal axis indicates distance along the optic axis, 
relative to the center of the prism. For design E, ¢ = 0 and e2 = 45°; for design F, ¢} = 45° and 
€, = 10°. From Egerton (1980b), copyright Elsevier 


axial directions are equal; the prism is then double focusing. Although not essential, 
an approximation to double focusing is generally desirable because it minimizes 
the width (in the y-direction) of the image at the energy-selecting plane, making 
the energy resolution of the system less dependent on the precise orientation of the 
detector about the z-axis. 

For a bend angle of 90°, the most symmetrical solution of Eqs. (2.2) and (2.3) 
corresponds to the double-focusing condition: vu = vy = vy = 2R and tane, = 
tan é2 = 0.5 (i.e., ¢ = 26.6°). In practice, the object distance u may be dictated by 
external constraints, such as the location of the projector lens crossover in an elec- 
tron microscope column. The spectrometer will still be double focusing provided 
the prism angles ¢; and €2 satisfy the relation (valid in the SCOFF approximation) 


1 [eee tane; — R/u 
tané2 = 


(2.4) 
2 tane; + R/u+cot@ 1 — f(tane; — R/u) 


As & increases, the required value of €2 decreases, as illustrated in Fig. 2.8. The 
image distance v, = vy = vis given by 


vo tane, — R/u 


=i 
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Fig. 2.8 Double-focusing parameters of a magnetic prism, for a fixed object distance (u = 2R) 
and bend angle ¢ = 90°. The curves were calculated using the SCOFF approximation; dashed 
lines indicate the region in which correction of second-order aberrations requires excessive edge 
curvatures, as determined by Eq. (2.18). One set of points is given for an extended fringing field 
(EFF) with K; = 0.4. From Egerton (1980b), copyright Elsevier 


A large difference between ¢; and €2 leads to stronger focusing, reflected in a shorter 
image distance (Fig. 2.8). The dispersive power D = dx/dEo at the image plane is 
(Livingood, 1969) 


Hm R ) sing + (1 — cos ¢)(tan ¢; + R/u) 
~ 2yT/ sin @g[1 — tan e2(tan e; + R/u)] — cos d(tan e; + tan é2 + R/u) 
(2.6) 


where 2y T = Eo(2moc? + Eo)(moc2 + Epo), Eo represents the kinetic energy of elec- 
trons entering the spectrometer, and moc? = 511 keV is the electron rest energy. 
If the spectrometer is to be reasonably compact, the value of R cannot exceed 
10-20 cm and D is limited to a few micrometers per electron volt for Ey = 100 keV. 


2.2.1.1 The Effect of Fringing Fields 


The SCOFF approximation is convenient for discussing the general properties of a 
magnetic prism and is useful in the initial stages of spectrometer design, but does 
not provide accurate predictions of the focusing. The effects of a spatially extended 
fringing field have been described by Enge (1964) as follows. 

First of all, the exit beam is displaced in the radial direction compared to the 
SCOFF trajectory. This effect can be taken into account by shifting the magnet 
slightly in the +x-direction or by increasing the magnetic field by a small amount. 
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Second, the focusing power in the axial (y-) direction is decreased, whereas the 
radial (x-) focusing remains practically unaltered. As a result, either ¢; or ¢2 must be 
increased (compared to the SCOFF prediction) in order to maintain double focusing. 
The net result is a slight increase in image distance; see Fig. 2.8. 

A third effect of the extended fringing fields is to add a convex component of 
curvature to the entrance and exit edges of the magnet, the magnitude of this com- 
ponent varying inversely with the polepiece width w. Such curvature affects the 
spectrometer aberrations, as discussed in Section 2.2.2. Finally, the extended fring- 
ing field introduces a discrepancy between the “effective” edge of the magnet (which 
serves as a reference point for measuring object and image distances) and the actual 
“mechanical” edge, the former generally lying outside the latter. 

To define the spatial extent of the fringing field, so that it can be properly taken 
into account in EFF calculations and is less affected by the surroundings of the 
spectrometer, plates made of a soft magnetic material (“mirror planes’) are some- 
times placed parallel to the entrance and exit edges, to “clamp” the field to a low 
value at the required distance from the edge. If the plate—polepiece separation is 
chosen as g/2, where g is the length of the polepiece gap in the y-direction, and the 
polepiece edges are beveled at 45° to a depth g/2 (see Fig. 2.9), the magnetic field 
decays almost linearly over a distance g along the optic axis. More importantly, 
the position, angle, and curvature of the magnetic field boundary more nearly coin- 
cide with those of the polepiece edge. However, the correspondence is not likely to 
be exact, partly because the fringing field penetrates to some extent into the holes 
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Fig. 2.9 Cross sections (in the x-z and x—y planes) through an aberration-corrected spectrometer 
with curved and tapered polepiece edges, soft-magnetic mirror plates, and window-frame excita- 
tion coils. The magnet operates in air, the vacuum being confined within a nonmagnetic “drift” 
tube. From Egerton (1980b), copyright Elsevier 


2.2 Optics of a Magnetic Prism Spectrometer 49 


that must be provided in the mirror plates to allow the electron beam to enter and 
leave the spectrometer (Fig. 2.9). The remaining discrepancy between the effective 
and mechanical edge depends on the polepiece gap, on the separation of the field 
clamps from the magnet, and on the radius of curvature of the edges (Heighway, 
1975). 

The effect of the fringing field on spectrometer focusing can be specified in terms 
of the gap length g and a shape parameter K, defined by 


[oe) / / 
me / GIBBON 5 (2.7) 
=6 eB? 

where b,(z’) is the y-component of induction at y = 0 and at a perpendicular distance 
z’ from the polepiece edge; B is the induction between the polepieces within the 
interior of the spectrometer. The SCOFF approximation corresponds to K; = 0; the 
use of tapered polepiece edges and mirror plates, as specified above, gives K} ~ 
0.4. If the fringing field is not clamped by mirror plates, the value of Ky is higher: 
approximately 0.5 for a square-edged magnet and 0.7 for tapered polepiece edges 
(Brown et al., 1977). If the polepiece gap is large, a second coefficient Ky may be 
necessary to properly describe the effect of the fringing field; however, its effect is 
small for g/R < 0.3 (Heighway, 1975). 


2.2.1.2 Matrix Notation 


Particularly when fringing fields are taken into account, the equations needed to 
describe the focusing properties of a magnetic prism become quite complicated. 
Their form can be simplified and the method of calculation made more systematic 
by using a matrix notation, as in the design of light-optical systems. The optical path 
between object and image is divided into sections and a transfer matrix written down 
for each section. The first stage of the electron trajectory corresponds to drift in a 
straight line through the field-free region between the object plane and the entrance 
edge of the magnet. The displacement coordinates (x, y, z) of an electron change, but 
not its angular coordinates (x’ = dx/dz, y’ = dy/dz). Upon arrival at the entrance 
edge of the magnet, these four coordinates are therefore given by the following 
matrix equation: 


x 1 u 0 0 xO 
x} {O 1 0 OF Jx% 
y} 10 0 1 uff yo 28) 
y 0 0 0 1 Y 


Here, xo and yo are the components of electron displacement at the object plane, xj 
and yp are the corresponding angular components, and the 4 x 4 square matrix is 
the transfer matrix for drift over a distance u (measured along the optic axis). 

The electron then encounters the focusing action of the tilted edge of the mag- 
net. In the SCOFF approximation, the focusing powers are 1/f, = —(tane)/R and 
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1/f, = (tane)/R. The focusing being of equal magnitude but opposite sign in the 
x- and y-directions, the magnet edge is equivalent to a quadrupole lens. Allowing 
for extended fringing fields, the corresponding transfer matrix is (Brown, 1967) 


1 0 0 0 
at 
0 0 —R I! tan(ey-— Wy) 1 
where yr, represents a correction for the extended fringing field, given by 
Wi © (g/R)Ki(1 + sin? €1)/ coset (2.10) 


The third part of the trajectory involves bending of the beam within the interior of 
the magnet. As discussed in Section 2.1.1, the uniform magnetic field has a positive 
(convex) focusing action in the x-direction but no focusing action in the y-direction. 
The effect is equivalent to that of a dipole field, as produced by a sector magnet 
with ¢1 = €2 = 0. If ¢ is the bend angle, the corresponding transfer matrix can be 
written in the form (Penner, 1961) 


cos Rsing 0 
—R-'sing cosd 0 
0 0 1 (2.11) 

0 


0 0 


re COCO 


Upon arrival at the exit edge of the prism, the electron again encounters an effective 
quadrupole, whose transfer matrix is specified by Eqs. (2.9) and (2.10) but with e2 
substituted for ¢;. Finally, after leaving the prism, the electron drifts to the image 
plane, its transfer matrix being identical to that in Eq. (2.8) but with the object 
distance u replaced by the image distance v. 

Following the rules of matrix manipulation, the five transfer matrices are multi- 
plied together to yield a 4 x 4 transfer matrix that relates the electron coordinates 
and angles at the image plane (xj, y;, x;, and y‘) to those at the object plane. However, 
the first-order properties of a magnetic prism can be specified more completely by 
introducing two additional parameters. One of these is the total distance or path 
length / traversed by an electron, which is of interest in connection with time-of- 
flight measurements but not relevant to dispersive operation of a spectrometer. The 
other additional parameter is the fractional momentum deviation 6 of the electron, 
relative to that required for travel along the optic axis (corresponding to a kinetic 
energy Eo and zero energy loss). This last parameter is related to the energy loss 
Eby 


§ = —E/(2yT) (219) 


where 2yT = Eo(2moc* + Eo)/(moc? + Eo). The first-order properties of the prism 
are then represented by the equation 
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Xt Ri, Ri2 0 0 O Rte xO 
x Ro, = =Ro 0 0) O R2%6 a 
Yj 0 0 R33, R3a OO 0 yo 
y; = 0 0 R44 Rag O 0 Y (2.13) 
li R51 R52 O O 1. R56 Io 
6j 0 0 0 0 0 1 50 


Many of the elements in this 6 x 6 matrix are zero as a result of the mirror symmetry 
of the spectrometer about the x—z plane. Of the remaining coefficients, Rj; and R33 
describe the lateral image magnifications (M, and My) in the x- and y-directions. In 
general, Rj; 4 R33, so the image produced by the prism suffers from rectangular 
distortion. For a real image, Rj; and R33 are negative, denoting the fact that the 
image is inverted about the optic axis. Roz and R44 are the angular magnifications, 
approximately equal to the reciprocals of Rj; and R33, respectively. 

Provided the spectrometer is double focusing and the value of the final drift 
length used in calculating the R-matrix corresponds to the image distance, Rj2 
and R34 are both zero. If the spectrometer is not double focusing, Rjz2 = 0 at the 
x-focus and the magnitude of R34 gives an indication of the length of the line focus 
in the y-direction. To obtain good energy resolution from the spectrometer, R12 
should be zero at the energy-selection plane and R34 should preferably be small. 
The other matrix coefficient of interest in connection with energy-loss spectroscopy 
is R16 = 0x;/0(50), which relates to the energy dispersion of the spectrometer. Using 
Eq. (2.12), the dispersive power D = —0x;/0E is given by 


D = Ri6/2yT) (2.14) 


The R-matrix of Eq. (2.13) can be evaluated by multiplication of the individual 
transfer matrices, provided the values of u, €1, ¢, €2, Ki, g, and v are specified. 
Such tedious arithmetic is best done by computer, for example, by running the 
TRANSPORT program (Brown et al., 1977). This program! also computes second- 
and third-order focusing, the effects of other elements (e.g., quadrupole or sextupole 
lenses), of a magnetic field gradient or inhomogeneity, and of stray magnetic fields. 


2.2.2 Higher Order Focusing 


The matrix notation is well suited to the discussion and calculation of second- 
order properties of a magnetic prism. Using the same six coordinates (x, x’, y, 
y, 1, and 5), second derivatives in the form (for example) 87x; / IX0IXG can be 
defined and arranged in the form of a 6 x 6 x 6 T-matrix, analogous to the first- 
order R-matrix. Many of the 6> = 216 second-order T-coefficients are zero or 


! Available from http://www.slac.stanford.edu/pubs/slacreports/slac-r-530.html 
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are related to one another by midplane symmetry of the magnet. For energy-loss 
spectroscopy, where the beam diameter at the object plane (i.e., the source size) 
is small and where image distortions and off-axis astigmatism are of little signifi- 
cance, the most important second-order matrix elements are T\22 = 47x; / Bt)" and 
T144 = 07x; i a). These coefficients represent second-order aperture aberrations 
that increase the image width in the x-direction and therefore degrade the energy 
resolution, particularly in the case of a large spread of incident angles (xp and yo). 

Whereas the first-order focusing of a magnet boundary depends on its effective 
quadrupole strength (equal to —tan ¢ = —dz/dx in the SCOFF approximation), 
the second-order aperture aberration depends on the effective sextupole strength: 
—(2¢0 cos’ ¢)~! in the SCOFF approximation (Tang, 1982a). The aberration coeffi- 
cients can therefore be varied by adjusting the angle ¢ and curvature p = 07z/x" of 
the boundary. Convex boundaries can only correct second-order aberration for elec- 
trons traveling in the x-y plane (Tj22 = 0), but if one boundary is made concave, 
the aberration for electrons traveling out of the radial plane can also be corrected 
(T122 = T\44 = 0). Alternatively, the correction can be carried out by means of 
magnetic or electrostatic sextupole lenses placed before and after the spectrometer 
(Parker et al., 1978). 

A second-order property that is of particular importance is the angle y between 
the dispersion plane (the plane of best chromatic focus for a point object) and the 
x-axis adjacent to the image; see Fig. 2.10. This tilt angle is related to the matrix 
element 726 = 07x;/ 0x59(5) by” 


tan y = —T126/(R22R16) (2.15) 


The condition y = 0 is desirable if lenses follow the spectrometer or if a parallel- 
recording detector is oriented perpendicular to the exit beam. More generally, 
adjustment of yw allows control over the chromatic aberration of whole system, 
external lenses included. Another second-order coefficient of some relevance is 
T166, which does not affect the energy resolution but specifies nonlinearity of the 
energy-loss axis. 

The matrix method has been extended to third-order derivatives, including 
the effect of extended fringing fields (Matsuda and Wollnik, 1970; Matsuo and 
Matsuda, 1971; Tang, 1982a, b). Many of the 1296 third-order coefficients are zero 
as a result of the midplane symmetry, and only a limited number of the remaining 
ones are of interest for energy-loss spectroscopy. The coefficients 47x;/ a(x) and 
a>x; / X99 (y" y represent aperture aberrations and (like Tj22 and Tj44) may have 
the same or opposite signs (Scheinfein and Isaacson, 1984). Tang (1982a) pointed 
out that correction of second-order aberrations by curving the entrance and exit 
edges of the magnet can increase these third-order coefficients, so that the latter 
limit the energy resolution for entrance angles y above 10 mrad. The chromatic 


If TRANSPORT is used to calculate the matrix elements, a multiplying factor of 1000 is required 
on the right-hand side of Eq. (2.15) as a result of the units (x in cm, x’ in mrad, and 6 in %) used in 
that program. 
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Fig. 2.10 Electron optics of ———twObject 
a double-focusing 
spectrometer with curved 
polefaces. The y-axis and the 
applied magnetic field are 
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the diagram. The 
polepiece-tilt angles (¢; and 
€) refer to the central 
trajectory (the optic axis). 
Exit trajectories of 
energy-loss electrons are 
shown by dashed lines. From 
Egerton (1980b), copyright 
Elsevier 
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term 0°x;,/(6)° causes additional nonlinearity of the energy-loss axis but is 
likely to be important only for energy losses of several kiloelectron volts. The 
coefficients 97x; /8(x)?9(5), 9°x1/9(%)"A(4), and 43x; /9(x)9(5)? introduce tilt 
and curvature of the dispersion plane, which may degrade the energy resolution 
when a parallel-recording system is used. 

If third-order aberrations are successfully corrected, for example, by the use 
of octupole lenses outside the spectrometer (Tang, 1982b; Krivanek et al., 2008), 
the energy resolution is limited by the fourth-order aberrations: 0*x;/ OGas 
d+x; / a(%)*, and 4x;/ A(x4)79(0%). Fourth-order matrix theory has not been devel- 
oped but ray-tracing programs can be used to predict the focusing of electrons. They 
work by evaluating the rate of change of momentum as (—e)(v x B) and using this 
information to define an electron path, initially the optic axis. A trajectory originat- 
ing from the center of the object plane but at a small angle relative to the optic axis 
is then evaluated, where this ray arrives back at the optic axis defines the Gaussian 
image plane. The positions on this plane of electrons with increasing angular devia- 
tion define an aberration figure (or spot diagram), from which aberration coefficients 
can be estimated. One program that implements ray tracing is SIMION (http:// 
simion.com), which runs on Windows and Linux computers. 

Spectrometer aberrations are particularly important in the case of core-loss spec- 
troscopy involving higher energy ionization edges, where the angular range of the 
inelastic scattering can extend to tens of milliradians and where high collection 
efficiency is desirable to obtain adequate signal. 


2.2.3 Spectrometer Designs 


To illustrate the above concepts, we outline a procedure for designing a double- 
focusing spectrometer with aperture aberrations corrected to second order by 


54 2  Energy-Loss Instrumentation 


curving the entrance and exit edges. First of all, the prism angles are chosen so 
as to obtain suitable first-order focusing. As discussed below, the value of ¢; should 
either be fairly large (close to 45°) or quite small (<10°, or even negative). Knowing 
the location of the spectrometer object point and the required bend radius R (which 
determines the energy dispersion D and the size and weight of the magnet), approx- 
imate values of €2 and v can be calculated using Eqs. (2.4) and (2.5). If either v or 
€2 turns out to be inconveniently large, a different value of ¢; must be selected. 

These first-order parameters are then refined to take account of extended fringing 
fields, requiring a knowledge of the integral K; (which depends on the shape of the 
polepiece corners and on whether magnetic field clamps are to be used) and the 
polepiece gap g (typically 0.1R—-0.2R). The TRANSPORT program uses a fitting 
procedure to find the exact image distance v corresponding to an x-focus (R12 = 0). 
The value of R34 will then be nonzero, indicating a line focus. If |R34| is excessive 
(>1 jpm/mrad), either ¢; or €2 is changed slightly to obtain a closer approach to 
double focusing. The dispersive power of the spectrometer is estimated from Eq. 
(2.6) or obtained more accurately using Eq. (2.14). 

The next stage is to determine values of the edge curvatures R; and R2 that make 
the second-order aperture aberrations zero. This is most easily done by recognizing 
that T7122 and 7)44 both vary linearly with the edge curvatures. In other words, 


— T122 = ag + a(R/R 1) + a2(R/R2) (2.16) 


— Ty44 = bo + bi (R/R1) + b2(R/R2) (2.17) 


where ao, a1, a2, bo, bj, and b2 are constants for a given first-order focusing. In 
general, ag is positive but a; and a are negative; T}22 can therefore be made zero 
with R; and R»2 both positive, implying convex entrance and exit edges. However, 
bo, bj, and bz are usually all positive so Tj44 = O requires that either R; or Ro be 
negative, indicating a concave edge (Fig. 2.10). The required edge radii (R} and R35) 
are found empirically by using the matrix program to calculate T)22 and 7,44 for 
three arbitrary pairs of R/R; and R/Ro, such as (0, 0), (0, 1), and (1, 0), generating 
six simultaneous equations that can be solved for ao, a1, a2, bo, bi, and bz. Then Ri 
and Ry are deduced by setting T}22 and Tj44 to zero in Eqs. (2.16) and (2.17). 

Not all spectrometer geometries yield reasonable values of Rj and R5. For exam- 
ple, the completely symmetric case (u = v = 2R, €1 = €2 = 26.6° for @ = 90° in 
the SCOFF approximation) gives R} = R} = 0, corresponding to infinite curvature. 
As |&, — &2| increases, the necessary edge curvatures must be kept reasonably low 
because the maximum effective width w* of the polepieces at the entrance or exit 
edge is given by 


w* = 2|R*| (1 — sin|el) (2.18) 


for a concave edge and by w* = 2R*(cose) for a convex edge. In practice, the 
concave edge corresponds to the higher value of ¢, so (for small R*) Eq. (2.18) 
imposes an upper limit on the angular range (x’) of electrons that can pass through 
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Fig. 2.11 (a) Magnetic sextupole and (b) multipole lens, used to correct the aberrations of a TEM 
lens or a spectrometer. Courtesy of Max Haider, CEOS GmbH 


the magnet. This limitation is not present if aberrations are adjusted by means of 
external sextupoles. With the addition of external octupoles, third-order aberrations 
to be corrected (Krivanek et al., 2008). Such multipole devices are also used to 
correct the aberrations of axially symmetric imaging lenses; see Fig. 2.11. 

In the above analysis, the object distance u and bend radius R were assumed to be 
fixed by the geometry of an electron microscope column and the space available for 
the spectrometer. If the ratio u/R can be varied, there is freedom to adjust a further 
second-order matrix element, such as 726. Parker et al. (1978) showed that there 
can be two values of image (or object) distance for which T}2¢ = 0, giving zero tilt 
of the dispersion plane. 

Table 2.1 gives examples of aberration-corrected designs. The recent GIF 
Quantum spectrometer (Gubbens et al., 2010) uses a gradient field design to reduce 


Table 2.1 Design parameters for aberration-corrected spectrometers 


(deg) ¢; (deg) e€2 (deg) WR WR = Ri/R_ -R5/R g/R Reference 


60 14.64 18.07 oo 1.46 1.351 —2.671 0.07 Fields (1977) 

90 0 45.0 1.45 2.16 0.807  —1.357 0.2 Egerton (1980b) 

70 11.75 = 28.79 3.60 2.38 0.707 —0.603 0.137 Shuman (1980) 

90 17.5 45.0 5 0.98 1.0 —0.496 0.125  Krivanek and 
Swann (1981) 

66.6 —15 45.8 2.25 2.06 2.34 —1.30 0.18 Tang (1982a) 

90 15.9 46.5 4.52 1.08 0.867  —0.500 0.19 — Reichelt and Engel 
(1984) 

80 14.6 35.1 3.5 1.82 0.728 —0.576 0.25 Scheinfein and 
Isaacson (1984) 

90 16 47 6.2 0.9 1.0 —0.42 0.125 Krivanek et al. 
(1995) 

90 0 0 10.9 0.7 lo) lee) 0.227  Gubbens et al. 


(2010) 
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the poleface angles (¢ and €2). The pole faces are not curved; aberration correction 
(including partial fourth and fifth orders) is achieved by means of three external 
dodecupole (12-pole) lenses, one before the prism and two between the prism and 
the energy-selecting slit. The poles of these lenses can be individually excited to 
generate any combination of dipole, quadrupole, sextupole, and higher order ele- 
ments, in order to control the prism focusing up to sixth order. Because these 
different elements share the same optic axis, alignment is easier than with sepa- 
rate elements. A further five dodecupoles (after the slit) project a spectrum or an 
energy-filtered image onto the CCD detector. 


2.2.4 Practical Considerations 


The main aim when designing an electron spectrometer is to achieve good energy 
resolution even in the presence of a large spread y of entrance angles, enabling 
the spectrometer system to have a high collection efficiency (see Section 2.3). For 
y = 10 mrad, correction of second-order aberrations allows a resolution +1 eV for 
energy losses up to | keV (Krivanek and Swann, 1981; Colliex, 1982; Scheinfein 
and Isaacson, 1984). The value of y is limited by the internal diameter of the “drift” 
tube (Fig. 2.9), which is necessarily less than the magnet gap g, so the historical 
trend has been toward relatively large g/R ratios (see Table 2.1), even though this 
makes accurate calculation of the fringing-field properties more difficult (Heighway, 
1975). The use of multipole elements, giving partial correction up to fifth order, can 
provide a resolution below 0.1 eV (Gubbens et al., 2010). 

The energy range falling on the detector depends on the bend radius R and the 
dispersion D, which increases with decreasing beam energy Eo. In the standard (SR) 
version of the GIF Quantum spectrometer, the bend radius R has been reduced from 
100 to 75 mm, allowing 2 keV range for 200 keV electrons or 682 eV at 60 keV. An 
even smaller value (50 mm) is scheduled for lower-voltage TEMs (15-60 keV) and 
a larger one (200 mm) for high-voltage operation (400-1250 keV). 

Spectrometer designs such as those in Table 2.1 assume that the magnetic induc- 
tion B within the magnet is uniform or (in the gradient field case) varies linearly with 
distance x from the optic axis. More generally, the induction might vary according to 


B(x) = B(O)[1 — n(x/R) — m(x/RY? + +] (2.19) 


where the coefficients n,m, ... introduce multipole components in the focusing. In 
a gradient field spectrometer the value of n depends on the angle between the pole- 
faces, which controls first-order focusing in the x- and y-directions, as an alternative 
to tilting the entrance and exit faces. Likewise, a sextupole component (dependent 
on m) could be deliberately added to control second-order aberrations (Crewe and 
Scaduto, 1982). However, the focusing properties are quite sensitive to the values 
of n and m; matrix calculations suggest that changing m by two parts in 10~° will 
degrade the energy resolution by | eV, for y = 10 mrad (Egerton, 1980b). Therefore 
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unintended variations must be avoided if a spectrometer is to behave as designed. 
A “C-core” magnet (where the magnetic field is generated by a coil connected 
by side arms to the polepieces) does not provide the required degree of unifor- 
mity, whereas a more symmetrical arrangement with window frame coils placed 
on either side of the gap (Fig. 2.9) can give a sufficiently uniform field, particularly 
if the separation between the planes of the two coils is carefully adjusted (Tang, 
1982a). 

Further requirements for field uniformity are that the magnetic material is suffi- 
ciently homogeneous and adequately thick. Homogeneity is achieved by annealing 
the magnet after machining and by choosing a material with high relative perme- 
ability jz and low coercivity at low field strength, such as mu-metal. Its minimum 
thickness f can be estimated by requiring the magnetic reluctance (« w/tj1) of each 
polepiece (in the x-direction) to be much less than the reluctance of the gap (« g/w), 
giving 


t>> w’/(ug) (2.20) 


Equation (2.20) precludes the use of thin magnetic sheeting, which would otherwise 
be attractive in terms of reduced weight of the spectrometer. 

It might appear that B-uniformity of 2 x 10~!° would require the polepiece gap to 
be uniform to within 2 jum over an x-displacement of | cm, for g = 1 cm. However, 
the allowable variation in field strength is that averaged over the whole electron 
trajectory, variations in the z-direction having less effect on the focusing. Also, pro- 
vided they are small, the x- and x*-terms in Eq. (2.19) can be corrected by external 
quadrupole and sextupole coils. 

A substantial loss of energy resolution can occur if stray magnetic fields penetrate 
into the spectrometer. Field penetration can be reduced by enclosing the magnet and 
(more importantly) the entrance and exit drift spaces in a soft magnetic material such 
as mu-metal. Such screening is usually not completely effective but the influence of 
a remaining alternating field can be canceled by injecting a small alternating current 
into the spectrometer scan coils, as described in Section 2.2.5. 

The magnetic induction within the spectrometer is quite low (<0.01 T for 
100 keV operation) and can be provided by window frame coils of about 100 turns 
carrying a current of the order of | A. To prevent drift of the spectrum due to changes 
in temperature and resistance of the windings, the power supply must be current 
stabilized to within one part in 10° for 0.3-eV stability at 100-keV incident energy. 
Stability is improved if the power supply is left running continuously. 


2.2.5 Spectrometer Alignment 


Like all electron-optical elements, the magnetic prism performs to its design spec- 
ifications only if it is correctly aligned relative to the incoming beam of electrons. 
Since the energy dispersion is small for high-energy electrons, this alignment is 
fairly critical if the optimum energy resolution is to be achieved. 
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2.2.5.1 Initial Alignment 


When a spectrometer is installed for the first time, or if the alignment of the spec- 
trometer or the microscope column has been disturbed, the electron beam may travel 
in a path that is far from the optic axis of the spectrometer (defined by the prism 
orientation and the value of the magnetic induction B). In this situation, a rough 
alignment of the system can be carried out in much the same way as alignment of 
an electron microscope column. Beam-limiting apertures, such as the spectrometer 
entrance aperture, are withdrawn and the entrance beam broadened, for example, by 
defocusing the illumination at the specimen plane. It may also be useful to sweep 
the magnetic field periodically by applying a fast ramp to the spectrometer scan 
coils, to deflect the exit beam over a range of several millimeters in the x-direction. 
Use of a two-dimensional detector, such as a phosphor screen and CCD camera, 
allows the exit beam to be located in both the x- and y-directions, especially if the 
energy-selecting slit is withdrawn. By alternately focusing and defocusing the pre- 
spectrometer lenses, it is possible to discover if the electron beam is passing through 
the center of the drift tube or is cut off asymmetrically by the tube walls or fixed 
apertures. To ensure that the beam travels close to the mechanical axis of the spec- 
trometer, it may be desirable to shift or tilt the magnet so that the positions (on the 
phosphor screen) where the exit beam is cut off are symmetric with respect to the 
center of the detector. 


2.2.5.2 Aberration Figure 


For optimum performance from the spectrometer, the beam must travel close to the 
magnetic axis of the prism and the spectrometer focusing must be correct. The desir- 
able conditions can be recognized from the shape of the beam at the detector plane. 
The first-order focusing is correct when the exit beam at the detector plane has min- 
imum width in the direction of dispersion. This condition is normally adjusted by 
means of quadrupole elements placed before or after the spectrometer. The focus- 
ing can be set more accurately if the depth of focus is made small, by using a large 
spectrometer entrance aperture and adjusting the TEM lenses so that the circle of 
illumination is large enough at that plane. 

If there were no aberrations and if the spectrometer were exactly double focusing, 
the exit beam would appear as a point or circle of very small diameter at the detec- 
tor plane. Spectrometer aberrations spread the beam into an aberration figure that 
can be observed directly on a fluorescent screen if the entrance divergence and the 
aberration coefficients are large enough. Second-order aberrations produce a figure 
whose shape (Fig. 2.12) can be deduced from the equations 


xj = T122(xp)” + Ti4ao)” (2.21) 
yi = T324x9¥0 (2.22) 


T\22, T144, and 7324 are matrix coefficients that represent second-order aperture 
aberrations; xj and yo represent the angular coordinates of an electron entering 


2.2 Optics of a Magnetic Prism Spectrometer 59 


(a) A>0,B>0 (b) A>0, B<O 


Fig. 2.12 Aberration figures of a properly aligned magnetic prism whose energy resolution is 
determined by second-order aberration coefficients Tj22 = —A and T\44 = —B that are (a) of the 
same sign and (b) of opposite sign. The entrance angles x9’ and yo’ are assumed to be limited to 
values in the range —y to +y by a square entrance aperture; the result of a circular entrance aperture 
is similar except that the top of figure (a) is convex. Dotted lines indicate the position of the detector 
slit when recording an alignment figure. The matrix element 7324 is denoted by F. From Egerton 
(1981b), copyright Elsevier 


the spectrometer. For a fixed x and a range of yf (or vice versa), the relation- 
ship between the image-plane coordinates x; and y; is a parabola. In practice, both 
Xo and yy take a continuous range of values: —y to +y, where y is the maxi- 
mum entrance angle (defined by a spectrometer entrance aperture, for example). 
The image-plane intensity is then represented by the shaded area in Fig. 2.12. 
When the magnet is correctly aligned, this figure is symmetric about the vertical 
(x-) axis; second-order aberrations are properly corrected when its width in the 
x-direction is a minimum. Because the aberration figure has very small dimensions, 
it is difficult to observe unless the spectrometer is followed by magnifying electron 
lenses. 


2.2.5.3. Alignment (Nonisochromaticity) Figure 


An alternative way of observing the aberration properties of a spectrometer is to 
place a narrow slit in its image plane and measure the electron flux through this 
slit by means of a single-channel detector (e.g., scintillator and photomultiplier), as 
in the case of serial recording of energy-loss spectra. Rather than scanning the exit 
beam across the slit, the entrance angle is varied by rocking the entrance beam about 
the spectrometer object point (Fig. 2.10). For a TEM fitted with a scanning attach- 
ment, the incident probe can be scanned over the specimen plane in the form of a 
two-dimensional raster; if the object plane of the spectrometer contains a diffrac- 
tion pattern of the specimen (at the projector lens cross-over, for example), the 
beam entering the spectrometer is swept in angle in both the x- and y-directions. 
Applying voltages proportional to x, and yj to the horizontal and vertical channels 
of an oscilloscope and using the signal from the electron detector to modulate the 
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brightness of the oscilloscope beam (z-modulation), an alignment figure is obtained 
whose shape depends on the aberrations that directly affect the resolving power of 
the spectrometer. 

An electron arriving at the image plane will pass through the detection slit 
provided 


=“) ex ead (2.23) 


where s is the slit width in the x-direction and 5 specifies the position of the aber- 
ration figure relative to the slit; see Fig. 2.12. For the case where second-order 
aberrations are dominant, the shape of the alignment figure is specified by Eqs. 
(2.21) and (2.23). If 6 = 0 and if T)22 and Tj44 are both negative (as in the case of a 
typical straight-edged magnet), a solid ellipse is formed, whose dimensions depend 
on the values of 7122, Tj44, and s (Fig. 2.13a). As the current in the spectrometer 
field coils is increased (6 > 0), the pattern shrinks inward and eventually disappears; 
if the spectrometer excitation is decreased, the pattern expands in outline but devel- 
ops a hollow center. If T}22 and 7144 were both positive, this same sequence would 
be observed as the spectrometer current were decreased. 

When 7122 and 7144 are of opposite sign, the alignment figure consists of a pair 
of hyperbolas (Fig. 2.13b). If T,44 > 0, the hyperbolas come together as the spec- 
trometer current is increased and then separate in the y-direction, as in Fig. 2.14. 
The sequence would be reversed if 7122 were the positive coefficient. 

When third-order aberrations are dominant, the aberration figure has three lobes 
(Fig. 2.15) but retains its mirror plane symmetry about the x-axis, in accordance 
with the symmetry of the magnet about the x—z plane. 


(a) APO, BO, 5=0 (b) A>O, B<0, 5=0 


PPiagitk Ry 


Fig. 2.13 Alignment figures of a magnetic prism whose energy resolution is limited by second- 
order aberration coefficients (T\22 = —A and T\44 = —B) that are (a) of the same sign and (b) of 
opposite sign. In (a), the scan range 2y is assumed to be larger than the major axis of the ellipse. 
From Egerton (1981b), copyright Elsevier 
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Fig. 2.14 Change in shape of the alignment figure as the spectrometer excitation is increased, for 
the case T122 < O and T144 > 0. In (a), 5 < 0; in (b), 5 © 0; in (c), 6 > 0. From Egerton (1981b), 
copyright Elsevier 


(a) (b) (c) 


Fig. 2.15 Calculated alignment figures for the magnetic prism spectrometer having (a) third-order 
and residual second-order aberrations (with T\22 and T\44 of opposite sign), (b) fourth-order and 
residual third-order aberrations, and (c) pure third-order aberrations. From Scheinfein and Isaacson 
(1984), copyright SEM Inc., Illinois 


Following from the above, some uses of the alignment figure are as follows. 


(1) In order to optimize the energy resolution, the spectrometer should be mechani- 
cally or electrically aligned such that the figure is symmetrical about its x4 axis. 
The most sensitive alignment is the tilt of the magnet about the exit-beam direc- 
tion but it is not easy to provide this rotation in the form of a single mechanical 
control. 

(2) The alignment figure enables the currents in multipole coils to be adjusted to 
compensate residual aberrations. To maximize the collection efficiency of the 
spectrometer for a given energy resolution, the currents should be adjusted so 
that the pattern is as large in area and as near-circular as possible. During this 
adjustment, it may be necessary to change the prism current to prevent the 
display from disappearing or developing a hollow center. 

(3) The symmetry of the alignment figure indicates the order of the uncorrected 
aberrations and the relative signs of the dominant aberration coefficients. The 
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absolute signs can be deduced from the change in the pattern as the spectrometer 
excitation is varied. The ratio of the coefficients can be estimated by measur- 
ing the aspect ratio (Fig. 2.13a) or angle between the asymptotes (Fig. 2.13b). 
Absolute magnitudes can be obtained if the display is calibrated in terms of 
entrance angle. 

If a spectrometer entrance aperture is inserted to limit the angular range of elec- 
trons entering the prism, its image should appear in outline on the display. To 
achieve the best combination of energy resolution and collection efficiency, the 
aperture is centered so that as little as possible of the alignment figure is cut off 
from the display. 

The influence of stray ac magnetic fields can be detected as a blurring or wavi- 
ness of the edges of the alignment figure. Imperfections in the energy-selecting 
slit (due to mechanical irregularity or contamination) show up as a streaking of 
the pattern. 
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2.2.5.4 Stray-Field Compensation 


Stray magnetic fields can easily affect the performance of an electron spectrometer. 
In the case of a magnetic prism attached to a conventional TEM, external fields can 
penetrate into the viewing chamber and deflect the electron beam before it enters the 
spectrometer. Slowly changes in field are minimized by installing a field compensa- 
tion system and by ensuring that movable magnetic objects, such as steel chairs, are 
replaced by nonmagnetic ones. Some of the external interference comes from mains 
frequency fields and can be compensated by a simple circuit that applies mains fre- 
quency current of adjustable amplitude and phase to the spectrometer excitation 
coils (Egerton, 1978b). External fields are less likely to be troublesome if the TEM 
viewing chamber is made of a magnetically shielding material (such as soft iron) or 
if the viewing chamber is eliminated, as in some recent TEM designs. 


2.3 The Use of Prespectrometer Lenses 


The single-prism electron spectrometer fitted to a conventional (fixed-beam) TEM 
is located below the imaging lenses, so electrons emerging from the specimen pass 
through these lenses before reaching the spectrometer. Not surprisingly, the per- 
formance of the EELS system (energy resolution, collection efficiency, and spatial 
resolution of analysis) is affected by the properties of the TEM imaging lenses and 
the way in which they are operated. 

The influence of TEM lenses on spectrometer performance was analyzed in a 
general way by Johnson (1980a, b), Egerton (1980a), and Krivanek et al. (1995). 
Good energy resolution requires that an electron-beam crossover of small diameter 
be placed at the spectrometer object plane. In practice, this crossover is either a low- 
magnification image of the specimen or a portion of its diffraction pattern (just the 
central beam, if a bright-field objective aperture is inserted). Since the spectrometer 
in turn images this crossover onto the EELS detector or the energy-selecting slit, 
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what is actually recorded represents a convolution of the energy-loss spectrum with 
the diffraction pattern or image of the specimen, sometimes called spectrum diffrac- 
tion or spectrum image mixing. In order to prevent diffraction or image information 
from seriously contaminating or distorting the energy-loss spectrum, the dimensions 
of the image or diffraction pattern (at the spectrometer object plane) must be made 
small relative to the energy dispersion. 

Some early spectrometer systems (Pearce-Percy, 1976; Joy and Maher, 1978; 
Egerton, 1978b) operated with the TEM projector lens turned off. Electrons were 
focused into a small crossover at the level of the TEM screen, which was also 
the spectrometer object plane. The region of specimen (diameter d) giving rise to 
the energy-loss spectrum was determined by the diameter of electron beam at the 
specimen or by inserting a selected area diffraction (SAD) aperture. The energy 
resolution available in this mode was analyzed by Johnson (1980a, b) and Egerton 
(1980a). 


2.3.1 TEM Imaging and Diffraction Modes 


Gatan spectrometers work with the projector lens on, as in normal TEM operation. 
The projector forms an optical crossover just below its lens bore, a distance h (typ- 
ically 30-40 cm) above the TEM viewing screen, and this crossover acts as the 
object point O of the spectrometer; see Fig. 2.16. Because the final TEM lens is 
designed to produce a large diameter image or diffraction pattern, the solid angle 
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Fig. 2.16 Simplified optics for the image and diffraction modes of a conventional TEM. S repre- 
sents the specimen; OBJ, INT, and PROJ represent the objective lens, intermediate-lens system, 
and final imaging (projector) lens. O and VS are the spectrometer object point and viewing screen; 
OA, SAD, and SEA are the objective, selected area diffraction and spectrometer entrance apertures 
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of divergence at O is large. Since electron optical brightness is conserved, the angle 
diameter product is constant; therefore, the crossover has a very small diameter. 

If the TEM is operated in image mode, with an image of the specimen of magni- 
fication M on the viewing screen, the spectrometer is said to be diffraction coupled 
because the projector lens crossover then contains a small diffraction pattern of the 
specimen. The size of this diffraction pattern is represented by a camera length: 
Lo = h/M, and can be as small as 1 jum. The angular range of scattering allowed 
into the spectrometer (the collection semi-angle #) is controlled by varying the size 
of the objective lens aperture. The region of specimen giving rise to the energy-loss 
spectrum is determined by a spectrometer entrance aperture (SEA) and corresponds 
to a portion of the image close to the center of the TEM viewing screen (before the 
screen is lifted to allow electrons through to the spectrometer). More precisely, the 
diameter of analysis is d = 2R/M’, where R is the SEA radius and M’ = M(h'/h) 
is the image magnification at the SEA plane, h’ being height of the projector lens 
crossover relative to the SEA.. Because of the large depth of field, an image that is 
in focus at the TEM screen is very nearly in focus at the SEA plane, so the SEA can 
act as an area-selecting aperture. 

If the TEM is operated in diffraction mode, with a diffraction pattern of camera 
length L at the viewing screen, the spectrometer is image coupled because the pro- 
jector crossover now contains an image of the illuminated area of the specimen. The 
image magnification at O is M, = h/L and is typically of the order of 1. Unless the 
objective aperture limits it to a smaller value, the collection semi-angle is B = R/U’, 
where L’ = L(h'/h) is the camera length at the SEA plane. To ensure that the SEA 
is centered on the optic axis, TEM diffraction shift controls have to be adjusted for 
maximum intensity of some sharp spectral feature. Alternatively, these controls can 
be used to select any desired region of the diffraction pattern for energy analysis. 
The area of specimen being analyzed is determined by the electron-beam diameter 
at the specimen or else by a selected area diffraction (SAD) aperture, if this aperture 
is inserted to define a smaller area. 

The above considerations are based on first-order geometric optics. Although 
some objective lenses are corrected for spherical aberration (Hawkes, 2008), most 
TEM imaging lenses suffer from spherical and chromatic aberrations, whose 
practical consequences we now discuss. 


2.3.2 Effect of Lens Aberrations on Spatial Resolution 


Because of chromatic aberration, a TEM image cannot be in focus for all energy 
losses. Most of this aberration occurs at the objective lens, where the image-plane 
angular divergence is higher than in subsequent lenses (Reimer and Kohl, 2008). 
If the objective (chromatic aberration coefficient C,, magnification M,) is focused 
for zero-loss electrons, an electron with energy loss E and scattering angle 6 arrives 
at the first image plane with a radial displacement R = M,@ Af relative to the optic 
axis, where Af = C,(E/Eo) and Eo is the incident energy. Because R is proportional 
to 6, the Lorentzian distribution of inelastic intensity dJ/dQ per unit solid angle 
(Chapter 3) gives rise to a Lorentzian distribution of intensity dJ/dA per unit area 
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in the image plane. The equivalent intensity at the specimen plane, the chromatic 
point-spread function (PSF), is given by 


PSF ox (? + rh) (2.24) 


Here r is a radial coordinate at the specimen and rg = 0g Af, where 0¢ © E/(2Eo) 
is the characteristic angle of inelastic scattering. 

In the case of inner-shell energy losses, the values of E and rg can be large. 
However, a common procedure is to increase the TEM high voltage (by an amount 
E\/e) so that, for some energy loss E; within the recorded range, chromatic aber- 
ration is zero (since these electrons have the same kinetic energy as the original 
zero-loss electrons). Chromatic broadening is minimized if E, corresponds to the 
center of the recorded range (width A), and for parallel-recording spectroscopy the 
maximum broadening (at either end of the range) corresponds to Eq. (2.24) with 


rE = Of Af © (E/2Eo\(A/2)(Ce/Eo) (2.25) 


In the case of energy-filtered (EFTEM) imaging (or serial EELS) with an energy- 
selecting slit, the electron intensity is summed over the slit width A, which is small. 
Then Eq. (2.24) can be integrated over energies within the slit to give (Egerton and 
Crozier, 1997) 


PSF x (1/r){tan™'[(re/r)E/(2BEo)] — tan~'[E/(2BEo)]} (2.26) 


for |r| < re = (A/2)(BC./Eo) and zero otherwise. Here, 6 is the maximum scat- 
tering angle contributing to the data, determined by an objective lens aperture. This 
function (curves in Fig. 2.17) can be integrated over r and the radius r5q containing 
50% of the electrons is found to be typically four to eight times smaller than the total 
radius r, of the chromatic disk; see data points in Fig. 2.17. A computer program 
is available to evaluate Eq. (2.26) and remove the chromatic spreading by deconvo- 
lution (Lozano-Perez and Titchmarsh, 2007). Quantum mechanical imaging theory 
suggests (Schenner et al., 1995) that the above geometrical optics analysis underes- 
timates the amount of blurring at low chromatic defocus because it does not include 
inelastic delocalization (Section 3.11). 

A similar geometric optics treatment of the effect of objective lens spherical 
aberration (coefficient C,) on a core-loss image gives 


—1 
PSF ox [? +02 C2! | (2.27) 


and rsq values are typically 2-10% of the total radius r,; = C,? (Egerton and 
Crozier, 1997). Spherical and chromatic aberration produce less spatial broadening 
when spectra or images are recorded from the valence-loss region, where E and 
@, are much smaller. For a TEM in which spherical or chromatic aberration of the 
imaging lenses are corrected, these sources of broadening would be absent. 
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Fig. 2.17 Chromatic aberration point-spread function for a core-loss image of a TEM specimen, 
evaluated from Eq. (2.26) for Ce = 2mm, 8 = 10 mrad, A = 20, and Ey = 100 keV. Open data 
points give the fraction of electron intensity contained within a radius r 


Equations (2.26) and (2.27) determine the spatial resolution not only of core-loss 
EFTEM images but also of energy-loss spectra, if an area-selecting aperture (e.g., 
spectrometer entrance aperture, for TEM image mode) is used to define the region of 
analysis. However, this region can instead be defined by the electron beam, as with a 
finely focused probe. The probe diameter is determined by the electron-source size, 
diffraction at the aperture of the probe-forming lens, and spherical and chromatic 
aberration of that lens. The chromatic broadening is 


Are © Co a AEo/Eo (2.28) 


where AEp is the energy width of the illumination, often below 1 eV, and for a 
strong probe-forming lens (C, © 2 mm) anda@ * 10 mrad, Ar, © 0.2 nm, so sub- 
nanometer probes are entirely practical. A similar argument applies to the resolution 
of a STEM image. 


2.3.3 Effect of Lens Aberrations on Collection Efficiency 


When a conventional TEM operates in image mode, lens aberrations produce a 
blurring of all image features, including the edge of the illumination disk. If the 
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Fig. 2.18 Electron intensity at the plane of the spectrometer entrance aperture (SEA) for micro- 
scope image mode (a)-(c) and diffraction mode (d). Chromatic aberration of post-specimen lenses 
changes each solid profile into the dashed one. Shaded areas represent electrons that are rejected 
by the SEA as a result of this aberration 


diameter of illumination on the TEM screen is much larger than the diameter of the 
spectrometer entrance aperture, this blurring occurs well outside the SEA perime- 
ter (see Fig. 2.18a) and will not affect the inelastic signal collected by the aperture. 
Considering only the chromatic aberration (broadening rg at the specimen plane, as 
discussed in the last section), this condition requires that 


Mr>R,+Mr, (2.29) 


where M' is the final magnification at the SEA plane, r is the radius of illumi- 
nation at the specimen, and R, is the SEA radius. In other words, the magnified 
radius of illumination must exceed the SEA radius by an amount at least equal to 
the chromatic broadening in the image. Provided an objective aperture is used, re 
is normally below | jvm and Eq. (2.29) can be satisfied by adjusting the condenser 
lenses so that the radius of illumination (at the TEM screen) is several times the SEA 
radius. Under these conditions, the loss of electrons (due to chromatic aberration) 
from points within the selected area is compensated by an equal gain from illumi- 
nated regions of specimen outside this area (Titchmarsh and Malis, 1989). However, 
this compensation is exact only if the current density is uniform within the disk of 
illumination and if the specimen is uniform in thickness and composition within this 
region. 

If the illumination is focused so that its screen-level radius becomes comparable 
to that of the spectrometer entrance aperture, part of the aberration tails can be cut 
off by the aperture (Fig. 2.18b) and the collection efficiency of the spectrometer will 
be reduced. Since rg is a function of energy loss, the decrease in collection efficiency 
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due to chromatic aberration is energy dependent and will change the elemental ratio 
deduced from measurements on two different ionization edges. 

If r¢ and r are small enough, however, the aberration tails may occur within 
the SEA (Fig. 2.18c) and the collection efficiency is unaffected. The necessary 
condition is 


Mr+M're < Ra (2.30) 


For Rg = 3 mm and r = r, = 100 nm, Eq. (2.30) can be satisfied if M’ < 15,000, 
assuming that the illumination can be accurately focused into the center of the 
spectrometer entrance aperture. 

Equations (2.29) and (2.30) represent conditions for getting no loss of collec- 
tion efficiency due to chromatic aberration. Because the radius rsq containing 50% 
of electrons is much less than re (see Fig. 2.17), the change in intensity and ele- 
mental ratio should be small until the aperture radius is reduced to M'r,/4 in most 
cases. 

If the TEM is operated in diffraction mode, chromatic aberration could change 
the distribution of inelastic intensity in the diffraction pattern and therefore the sig- 
nal collected by the spectrometer entrance aperture. Here the major chromatic effect 
is likely to arise from microscope intermediate lenses but should be significant only 
for energy losses above 500 eV and analyzed areas (defined by the incident beam 
or SAD aperture) larger than 3 jm in diameter (Yang and Egerton, 1992). Errors 
in quantitative analysis should therefore be negligible in the case of sub-micrometer 
probes (Titchmarsh and Malis, 1989). This conclusion assumes that the imaging 
lenses are in good alignment, a condition that can be ensured by positioning the 
illumination (or SAD aperture) so that the voltage center of the diffraction pattern 
coincides with the center of the viewing screen. 

If the energy-loss spectrum is acquired by serial recording, chromatic aber- 
ration effects are avoided by keeping the spectrometer at a fixed excitation and 
scanning through energy loss by applying a ramp signal to the microscope high- 
voltage generator. Likewise for parallel recording, the microscope voltage can be 
raised by an amount equal to the energy loss of interest, but chromatic aberration 
will reduce the collection efficiency for energy losses that differ from this value. 
In other words, the lens system acts as a bandpass filter and can introduce artifacts 
in the form of broad peaks in the spectrum (Kruit and Shuman, 1985b), particu- 
larly if the optic axis does not coincide with the SEA center (Yang and Egerton, 
1992). 


2.3.4 Effect of TEM Lenses on Energy Resolution 


The resolution in an energy-loss spectrum depends on several factors: the energy 
spread AEp of the electrons before they reach the specimen (reflecting the energy 
width of the electron source and the Boersch effect), broadening AEs. due to 
the spectrometer, dependent on its electron optics, and the spatial resolution 
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s of the electron detector (or slit width for serial recording). Because these 
components are independent, they can be added in quadrature, giving the measured 
resolution AE as 


(AE)? © (AEo)* + (AE)? + (s/D) (2.31) 


where D is the spectrometer dispersion. In general, AF. varies with energy loss E, 
since both the spectrometer focusing and the angular width of inelastic scattering are 
E-dependent. As a result, the energy resolution at an ionization edge can be worse 
than that measured at the zero-loss peak. 

The image produced by a double-focusing spectrometer is a convolution of the 
energy-loss spectrum with the image or diffraction intensity at the spectrometer 
object plane (the mixing effect referred to in Section 2.3). For an object of width do, 
an ideal spectrometer with magnification M, (in the direction of dispersion) would 
produce an image of width M,do. But if the spectrometer has aberrations of order n, 
the image is broadened by an amount C,,y”, where y is the divergence semi-angle of 
the beam entering the spectrometer and the aberration coefficient C,, depends on the 
appropriate nth-order matrix elements (Section 2.2.2). The spectrometer resolution 
is then given by 


(AEs)? © (Mxdo/DY” + (Cny"/DY (2.32) 


The values of d) and y depend on how the TEM lenses are operated and on the 
diameter of the spectrometer entrance aperture. 

In the absence of an entrance aperture, the product dyy would be constant, 
since electron-optical brightness is conserved (Reimer and Kohl, 2008). If the lens 
conditions are changed so as to reduce dy and therefore decrease the source size 
contribution to AE, , the value of y and of the spectrometer aberration term must 
increase, and vice versa. As a result, there is a particular combination of d, and y 
that minimizes AE, . This combination corresponds to optimum L, or Mz at the 
spectrometer object plane and to optimum values of M or L at the TEM viewing 
screen; see Fig. 2.19. 

The effect of a spectrometer entrance aperture (radius Ra, distance h’ below O) 
is to limit y to a value R,/h’, so that the second term in Eq. (2.32) cannot exceed 
a certain value. If the microscope is operated in image mode with the SEA defin- 
ing the area of analysis (M’d/2 > R,), AEs © 1 eV for a spectrometer with 
second-order aberrations corrected. But at very low screen magnifications or cam- 
era lengths, the energy resolution degrades, due to an increase in spectrometer object 
size; see Fig. 2.19. The only cure for this degradation is to reduce 6 (by using a small 
objective aperture) in image mode or to reduce the diameter d of the analyzed region 
in diffraction mode. 

The above analysis neglects aberrations of the TEM lenses themselves, which 
could affect the energy resolution if a specimen image is present at the spectrometer 
object plane (Johnson, 1980a; Egerton, 1980a). In serial acquisition, TEM chro- 
matic aberration is avoidable by scanning the high voltage; for parallel recording, 
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Fig. 2.19 Spectrometer resolution AF, as a function of microscope magnification (for image 
mode) or camera length (for diffraction mode), calculated for an illumination diameter of d = 11m 
at the specimen, 8 = 10 mrad, and a spectrometer with C2 = 0, C3 = 50 m, and D = 1.8 wm/ev 
at Eg = 100 keV. Dotted lines correspond to the situation in which the diameter of illumination 
(or the central diffraction disk) at the SEA plane exceeds the diameter (2R,) of a 3 mm of a 5-mm 
spectrometer entrance aperture 


it may be possible to bring each energy loss into focus by adjusting multipole 
lenses associated with the spectrometer (equivalent to adjusting the tilt y of the 
detector plane). 


2.3.5 STEM Optics 


If energy-loss spectroscopy is carried out in a dedicated scanning transmission 
electron microscope (STEM), there need to be no imaging lenses between the spec- 
trometer and specimen. However, the specimen is immersed in the field of the 
probe-forming objective lens, whose post-field reduces the divergence of the elec- 
tron beam entering the spectrometer by an angular compression factor M = B/y; 
see Fig. 2.20a. This post-field creates a virtual image of the illuminated area of the 
specimen, which acts as the object point O for the spectrometer (Fig. 2.10). The 
spectrometer is therefore image coupled with an object-plane magnification equal 
to the angular compression factor. 

Because there are no image-plane (area-selecting) apertures, the spatial resolu- 
tion is defined by the incident probe diameter d. Although this diameter is affected 
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Fig. 2.20 (a) Geometry of a focused STEM probe, showing the angular compression (M = B/y) 
introduced by the post-field. Point O acts as a virtual object for the spectrometer system. (b) 
Contributions to the energy resolution from spectrometer object size (SOS), from second- and 
third-order spectrometer aberrations (SA2 and SA3), and from spherical (SPH) and chromatic 
(CHR) aberrations, calculated as a function of M for y = 5 mrad and E = 400 eV. Also shown 
is the signal collection efficiency for E = 400 eV, assuming chromatic aberration is corrected 
by prespectrometer optics. Instrumental parameters are for a VG-HBS5 high-excitation polepiece 
(C,’ = 1.65 mm, C,/ = 2 mm, d = | nm, Ep = 100 keV) and an aberration-corrected spectrome- 
ter with C2 = 2 cm and C3 = 250 cm, M, = 0.73, D = 2 tm/eV (Scheinfein and Isaacson, 1984). 
(c) Contributions to the energy resolution calculated for the case where M and y are varied to 
maintain 6 = 25 mrad. The dashed line shows the resolution of an equivalent straight-edge magnet 
with uncorrected second-order aberrations (C2 = 140 cm) 


by spherical and chromatic aberration of the prefield, it is unaffected by aberra- 
tion coefficients of the post-field; therefore, the spatial resolution is independent of 
energy loss. The focusing power of a magnetic lens is roughly proportional to the 
reciprocal of the electron kinetic energy (Reimer and Kohl, 2008). Therefore the 
angular compression of the post-field changes with energy loss, but only by about 
1% per 1000 eV (for Ey = 100 keV), so the corresponding variation in collection 
efficiency is unimportant. 

Most STEM instruments now have post-specimen lenses in order to further 
compress the angular range and provide greater flexibility of operation. Chromatic 
aberration in these lenses can cause the collection efficiency to increase or decrease 
with energy loss, creating errors in quantitative analysis (Buggy and Craven, 1981; 
Craven et al., 1981). 

The energy resolution of the spectrometer/post-field combination can be ana- 
lyzed as for a TEM with image coupling. The contribution MM,.d/D, representing 
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geometric source size, is negligible if the incident beam is fully focused (a field- 
emission source allows values of d below 1 nm), but it becomes appreciable if 
the probe is defocused to several hundred nanometers or scanned over a similar 
distance with no descanning applied. Spherical and chromatic aberrations of the 
objective post-field (coefficients C,’ and C,’) broaden the spectrometer source size, 
resulting in contributions MMC’? /D and MM,C.B(E/Eo)/D , which become sig- 
nificant if M exceeds 10; see Fig. 2.20b. Finally, spectrometer aberration terms, of 
the form (C,/D)(6/M)" , can be made small by appropriate spectrometer design. 
Good resolution is possible with high collection efficiency (6 = 25 mrad) if M 
is of the order of 10; see Fig. 2.20c. Correction of spherical aberration of the 
imaging lenses permits a collection angle of more than 100 mrad (Botton et al., 
2010). 

Correction of spherical aberration of the probe-forming lens has resulted in probe 
diameters below 0.1 nm but with significant increase in the incident beam conver- 
gence, which places extra demands on the performance of the spectrometer and 
post-specimen optics. Quadrupole and octupole elements have recently been used 
to correct the remaining aberrations of the Enfina spectrometer, besides allowing 
the camera length of recorded diffraction patterns to be adjusted (Krivanek et al., 
2008). 


2.4 Recording the Energy-Loss Spectrum 


The energy-loss spectrum is recorded electronically as a sequence of channels, the 
electron intensity in each channel being represented by a number stored in computer 
memory. Historically, there have been two strategies for converting the intensity 
distribution into stored numbers. 

In parallel recording, a position-sensitive electron detector simultaneously 
records all of the incident electrons, resulting in relatively short recording times and 
therefore drift and radiation damage to the specimen during spectrum acquisition. 
The original parallel-recording device was photographic film, whose optical-density 
distribution (after chemical development) could be digitized in a film scanner. 
Position-sensitive detectors (based on silicon diode arrays) now provide a more 
convenient option; the procedures involved are discussed in Section 2.5. 

In serial recording, the spectrum at the image plane of the electron spectrometer 
is recorded by scanning it across a narrow slit placed in front of a single-channel 
electron detector. Because electrons intercepted by the slit are wasted, this method 
is inefficient, requiring longer recording times to avoid excessive statistical (shot) 
noise at high energy loss. But because the same detector is used to record all energy- 
loss channels, serial recording avoids certain artifacts (interchannel coupling and 
gain variations) that arise in parallel recording and is adequate or even preferable 
for recording low-loss spectra. 

Regardless of the recording system employed, it is often necessary to scan or 
shift the spectrum to record different ranges of energy loss, and various ways of 
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doing this are discussed in Section 2.4.1. Electron scattering in front of the detector, 
leading to a “spectrometer background,” is discussed in Section 2.4.2. The technique 
of coincidence counting, which can reduce the background to core-loss edges, is 
outlined in Section 2.4.3. 


2.4.1 Spectrum Shift and Scanning 


Several methods are available for scanning the energy-loss spectrum across an 
energy-selecting slit (as required for serial recording or EFTEM imaging) or for 
shifting it relative to the detector (often necessary in parallel recording). 


(a) Ramping the magnet. The magnetic field in a single-prism spectrometer can be 
changed by varying the main excitation current or by applying a current ramp 
to a separate set of window-frame coils. Because the detector is stationary, the 
recorded electrons always have the same radius of curvature within the spec- 
trometer. From Eq. (2.1), the magnetic induction required to record electrons of 
energy Eo is 

B = ymov/(eR) = (ecR)~'Eg(1 + 2moc?/Eo)'/? (2.33) 


where moc” = 511 keV is the electron rest energy. Even assuming ideal prop- 
erties of the magnet (B « ramp current), a linear ramp provides an energy axis 
that is slightly nonlinear due to the square-root term in Eq. (2.33); see Fig. 2.21. 
The nonlinearity is only about 0.5% over a 1000-eV scan (at Eg = 100 keV) 
and can be avoided by using a digitally programmed power supply. 

(b) Pre- or post-spectrometer deflection. Inductance of the magnet windings causes 
a lag between the change in B and the applied voltage, limiting the scan rate to 
typically 1 per second. Higher rates, which are convenient for adjusting the 
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Fig. 2.21 Spectrometer current / (relative to its value for 100-keV electrons) for a fixed bend radius 
R, as a function of the primary energy Eo. Also shown is the change A/ and fractional change AJ/I 
in current required to compensate for a 1000-eV change in energy of the detected electrons, from 
which the incident energy Ep can be measured. From Meyer et al. (1995), copyright Elsevier 
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position of the zero-loss peak and setting the width of the detection slit, are 
achieved by injecting a ramp signal into dipole coils located just before or after 
the spectrometer. Even faster deflection is possible by using electrostatic deflec- 
tion plates (Fiori et al., 1980; Craven et al., 2002) and this technique has enabled 
near-simultaneous recording of low-loss and core-loss spectra on the same CCD 
camera (Scott et al., 2008; Gubbens et al., 2010). 

Ramping the high voltage. The spectrum can also be shifted or scanned by 
applying a signal to the feedback amplifier of the microscope’s high-voltage 
generator, thereby changing the incident electron kinetic energy Ep. The spec- 
trometer and detection slit then act as an energy filter that transmits electrons 
of a fixed kinetic energy, thereby minimizing the unwanted effects of chromatic 
aberration in the post-specimen lenses (Section 2.3). The difference in energy of 
the electrons passing through the condenser lenses results in a change in illumi- 
nation focus, but this effect can be compensated by applying a suitable fraction 
of the scanning signal to the condenser lens power supply (Wittry et al., 1969; 
Krivanek et al., 1992). 

Drift-tube scanning. If the flight tube of a magnetic spectrometer is electri- 
cally isolated from ground, applying a voltage to it changes the kinetic energy 
of the electrons traveling through the magnet and shifts the energy-loss scale. 
The applied potential also produces a weak electrostatic lens at the entrance 
and exit of the drift tube, tending to defocus and possibly deflect the spec- 
trum, but these effects appear to be negligible provided the internal diameter 
of the drift tube and its immediate surroundings are not too small (Batson et al., 
1981). In the absence of an electrostatic lens effect, a given voltage applied to 
the drift tube will displace the energy-loss spectrum by the same number of 
electron volts, allowing the energy-loss axis to be accurately calibrated. Meyer 
et al. (1995) have shown that when a known voltage (e.g., 1000 eV) is applied 
to the drift tube and the zero-loss peak is returned to its original position by 
changing the magnet current J by an amount AJ, a measurement of A//I allows 
the accelerating voltage to be determined to an accuracy of about 50 V; see 
Fig. 2.21. 


Particularly where long recording times are necessary, it is convenient to add sev- 


eral readouts in computer memory, a technique sometimes called multiscanning or 
signal averaging. The accumulated data can be regularly displayed, allowing broad 
features to be discerned after only a few scans, so that the acquisition can be aborted 
if necessary. Otherwise, the readouts are repeated until the signal/noise ratio (SNR) 
of the data becomes acceptable. For a given total time T of acquisition, the SNR 
is similar to that for a spectrum acquired in a single scan of duration T but the 
effect of instrumental instability is different. Any drift in high voltage or prism 
current in Vo can be largely eliminated by shifting individual readouts so that a 
particular spectral feature (e.g., the zero-loss peak) always occurs at the same spec- 
tral channel (Batson et al., 1971; Egerton and Kenway, 1979; Kimoto and Matsui, 
2002). 
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2.4.2 Spectrometer Background 


The energy-loss spectrum of a thin specimen covers a large dynamic range, with 
the zero-loss peak having the highest intensity (e.g., Fig. 1.3). As a result, the 
zero-loss electrons can produce an observable effect when the zero-loss peak is 
at some distance from the detector. Stray electrons are generated by backscatter- 
ing from whatever surface absorbs the zero-loss beam and some of these find their 
way (by multiple backscattering) to the electron detector and generate a spectrom- 
eter background that typically varies slowly with energy loss and is therefore more 
noticeable at higher energy loss. In the Gatan parallel-detection system, a beam trap 
is used to minimize this backscattering, resulting in a background that (in terms of 
intensity per eV) is over 10° times smaller than the integrated zero-loss intensity. 

Because most of the stray electrons and x-rays are generated by the zero- and 
low-loss electrons, an instrumental background similar to that present in a real spec- 
trum can be measured by carrying out serial acquisition with no specimen in the 
beam. The resulting background spectrum enables the spectrometer contribution to 
be assessed and if necessary subtracted from real spectral data (Craven and Buggy, 
1984). 

The magnitude of the spectrometer background can also be judged from the 
jump ratio of an ionization edge, defined as the ratio of maximum and minimum 
intensities just after and just before the edge. A very thin carbon foil (<10 nm for 
Eo = 100 keV) provides a convenient test sample (Joy and Newbury, 1981); if the 
spectrum is recorded with a collection semi-angle f less than 10 mrad, a jump ratio 
of 15 or more at the K-edge indicates a low instrumental background (Egerton and 
Sevely, 1983). 

A further test for spectrometer background is to record the K-ionization edge of a 
thin (<50 nm) aluminum or silicon specimen in the usual bright-field condition (col- 
lection aperture centered about the optic axis) and in dark field, where the collection 
aperture is shifted or the incident illumination tilted so that the central undiffracted 
beam is intercepted by the aperture. In the latter case, stray electrons and x-rays 
are generated mainly at the collection aperture and have little chance of reaching 
the detector, particularly if an objective-lens aperture acts as the collection aperture. 
As a result, the jump ratio of the edge may be higher in dark field, the amount of 
improvement reflecting the magnitude of the bright-field spectrometer background 
(Oikawa et al., 1984; Cheng and Egerton, 1985). If the spectrometer background 
is high, the measured jump ratio may actually increase with increasing specimen 
thickness (Hosoi et al., 1984; Cheng and Egerton, 1985), up to a thickness at which 
plural scattering in the specimen imposes an opposite trend (Section 3.5.4). 

Although not usually a problem, electron scattering within the microscope col- 
umn can also contribute to the instrumental background. For example, insertion of 
an area-selecting aperture has been observed to degrade the jump ratio of an edge, 
presumably because of scattering from the edge of the aperture (Joy and Maher, 
1980a). 
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2.4.3 Coincidence Counting 


If an energy-dispersive x-ray (EDX) detector is operated simultaneously with an 
energy-loss spectrometer, it is possible to improve the signal/background and (in 
principle) the signal/noise ratio of an ionization edge. By applying both detector 
outputs to a gating circuit that gives an output pulse only when an energy-loss elec- 
tron and a characteristic x-ray photon of the same energy are received within a 
given time interval, the background to an ionization edge can be largely eliminated 
(Wittry, 1976). Some “false coincidences” occur, due to x-rays and energy-loss elec- 
trons generated in separate scattering events; their rate is proportional to the product 
of resolution time, x-ray signal, and energy-loss signal. To keep this contribution 
small, the incident beam current must be kept low, resulting in a low overall count 
rate. A small contribution from false coincidences can be recognized, since it has 
the same energy dependence as the energy-loss signal before coincidence gating, 
and subtracted (Kruit et al., 1984). A peak due to bremsstrahlung loss also appears 
in the coincidence energy-loss spectrum, at an energy just below the ionization-edge 
threshold (Fig. 2.22). 

Measured coincidence rates have amounted to only a few counts per second 
(Kruit et al., 1984, Nicholls et al., 1984) but with recent improvements in the 
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Fig. 2.22 (a) Scheme for simultaneous measurement of x-ray emission and coincident energy- 
loss electrons, here recorded serially. The energy window A, for x-ray gating is selected by a 
dual-level discriminator. (b) The coincidence energy-loss spectrum contains a background FC due 
to false coincidences and a small peak BR arising from bremsstrahlung energy losses at the x-ray 
gating window, here chosen to match the lowest energy x-ray peak 
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collection efficiency of EDX detectors, coincidence counting could become use- 
ful and routine. Ideally, the energy-loss spectrum would be recorded with a 
parallel-detection system operating in an electron counting mode. 

Particularly for light element detection, the x-ray detector could be replaced 
by an Auger electron detector as the source of the gating signal (Wittry, 1980). 
Alternatively, by using the energy-loss signal for gating, coincidence counting could 
be used to improve the energy resolution of the Auger spectrum (Cazaux, 1984; 
Haak et al., 1984). Coincidence between energy-loss events and secondary elec- 
tron generation has yielded valuable information about the mechanism of secondary 
electron emission (Pijper and Kruit, 1991; Miillejans et al., 1993; Scheinfein et al., 
1993). 


2.4.4 Serial Recording of the Energy-Loss Spectrum 


A serial recording system contains four components: (1) the detection slit, which 
selects electrons of a particular energy loss; (2) the electron detector; (3) a method 
of scanning the loss spectrum across the detection slit; and (4) a means of converting 
the output of the electron detector into binary numbers for electronic storage. These 
components are shown schematically in Fig. 2.23 and discussed below. 
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Fig. 2.23. A serial-acquisition system for energy-loss spectroscopy. An energy-selecting slit is 
located in the spectrometer image plane. Electrons that have passed through the slit cause lumi- 
nescence in a scintillator; the light produced is turned into an electrical signal and amplified by 
a photomultiplier tube (PMT). The PMT output is fed into a multichannel scaling (MCS) circuit 
whose ramp output scans the spectrum across the slit, the value of resistor R, determining the scan 
range and electron volt per channel 
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2.4.4.1 Design of a Detection Slit 


Because the energy dispersion of an electron spectrometer is only a few micrometers 
per electron volt at 100 keV primary energy, the edges of any energy-selecting slit 
must be smooth (on a jm scale) over the horizontal width of the electron beam at 
the spectrometer image plane. For a double-focusing spectrometer, this width is in 
principle very small, but stray magnetic fields can result in appreciable y-deflection 
and cause a modulation of the detector output if the edges are not sufficiently straight 
and parallel (Section 2.2.5). The slit blades are usually constructed from materials 
such as brass or stainless steel, which are readily machinable, although gold-coated 
glass fiber has also been used (Metherell, 1971). 

The slit width s (in the vertical direction) is normally adjustable to suit different 
circumstances. For examining fine structure present in the loss spectrum, a small 
value of s ensures good energy resolution of the recorded data. For the measurement 
of elemental concentrations from ionization edges, a larger value may be needed to 
obtain adequate signal (xs) and signal/noise ratio. For large s, the energy resolution 
becomes approximately s/D, where D is the dispersive power of the spectrometer. 

The slit blades are designed so that electrons which strike them produce negligi- 
ble response from the detector. It is relatively easy to prevent the incident electrons 
being transmitted through the slit blades, since the stopping distance (electron range) 
of 100-keV electrons is less than 100 jm for solids with atomic number greater than 
14. On the other hand, x-rays generated when an electron is brought to rest are more 
penetrating; in iron or copper, the attenuation length of 60-keV photons being about 
1 mm. Transmitted x-rays that reach the detector give a spurious signal (Kihn et al., 
1980) that is independent of the slit opening s. In addition, half of the x-ray photons 
are generated in the backward direction and a small fraction of these are reflected 
so that they pass through the slit to the detector (Craven and Buggy, 1984). Even 
more important, an appreciable number of fast electrons that strike the slit blades 
are backscattered and after subsequent backreflection may pass through the slit and 
arrive at the detector. Coefficients of electron backscattering are typically in the 
range of 0.1—0.6, so the stray-electron signal can be appreciable. 

The stray electrons and x-rays are observed as a spectrometer background to 
the energy-loss spectrum, resulting in reduced signal/background and signal/noise 
ratio at higher energy loss. Most of the energy-loss intensity occurs within the low- 
loss region, so most of the stray electrons and x-rays are generated close to the 
point where the zero-loss beam strikes the “lower” slit blade when recording higher 
energy losses. 

Requirements for a low spectrometer background are therefore as follows. The 
slit material should be conducting (to avoid electrostatic charging in the beam) and 
thick enough to prevent x-ray penetration. For 100-keV operation, 5-mm thickness 
of brass or stainless steel appears to be sufficient. The angle of the slit edges should 
be close to 80° so that the zero-loss beam is absorbed by the full thickness of the 
slit material when recording energy losses above a few hundred electron volts. The 
defining edges should be in the same plane so that, when the slit is almost closed to 
the spectrometer exit beam, there is no oblique path available for scattered electrons 
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(traveling at some large angle to the optic axis) to reach the detector. The length 
of the slit in the horizontal (nondispersive) direction should be restricted, to reduce 
the probability of scattered electrons and x-rays reaching the detector. A length of 
a few hundred micrometers may be necessary to facilitate alignment and allow for 
deflection of the beam by stray dc and ac magnetic fields. Since the coefficient 
n of electron backscattering is a direct function of atomic number, the “lower” 
slit blade should be coated with a material such as carbon (7 * 0.05). The easi- 
est procedure is to “paint” the slit blades with an aqueous suspension of colloidal 
graphite, whose porous structure helps further in the absorbing scattered electrons. 
The lower slit blade should be flat within the region over which the zero-loss beam 
is scanned. Sharp steps or protuberances can give rise to sudden changes in the scat- 
tered electron background, which could be mistaken for real spectral features (Joy 
and Maher, 1980a). For a similar reason, the use of “spray” apertures in front of the 
energy-selecting slit should be avoided. 

Moving the detector further away from the slits and minimizing its exposed area 
(just sufficient to accommodate the angular divergence of the spectrometer exit 
beam) decrease the fraction of stray electrons and x-rays that reach the detector 
(Kihn et al., 1980). Employing a scintillator whose thickness is just sufficient to stop 
fast electrons (but not hard x-rays) will further reduce the x-ray contribution. Finally, 
the use of an electron counting technique for the higher energy losses reduces the 
intensity of the instrumental background because many of the backscattered elec- 
trons and x-rays produce output pulses that fall below the threshold level of the 
discriminator circuit (Kihn et al., 1980). 


2.4.4.2 Electron Detectors for Serial Recording 


Serial recording has been carried out with solid-state (silicon diode) detectors but 
counting rates are limited (Kihn et al., 1980) and radiation damage is a potential 
problem at high doses (Joy, 1984a). Windowless electron multipliers have been used 
up to 100 Mcps but require excellent vacuum to prevent contamination (Joy, 1984a). 
Channeltrons give a relatively noisy output for electrons whose energy exceeds 
10 keV (Herrmann, 1984). For higher energy electrons, the preferred detector for 
serial recording consists of a scintillator (emitting visible photons in response to 
each incident electron) and a photomultiplier tube (PMT), which converts some of 
these photons into electrical pulses or a continuous output current. 


2.4.4.3 Scintillators 


Properties of some useful scintillator materials are listed in Table 2.2. 
Polycrystalline scintillators are usually prepared by sedimentation of a phosphor 
powder from aqueous solution or an organic liquid such as chloroform, sometimes 
with an organic or silicate binder to improve the adhesive properties of the layer. 
Maximizing the signal/noise ratio requires an efficient phosphor of suitable thick- 
ness. If the thickness is less than the electron range, some kinetic energy of the 
incident electron is wasted; if the scintillator is too thick, light may be lost by 
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Table 2.2 Properties of several scintillator materials* 


Peak Principal Energy Dose for 
wavelength decay conversion damage 
Material Type (nm) constant (ns) efficiency (%) (mrad) 
NE 102 Plastic 420 2.4, 7 3 1 
NE 901 Li glass 395 75 1 10° 
ZnS(Ag) Polycrystal 450 200 12 
P-47 Polycrystal 400 60 7 10? 
P-46 Polycrystal 550 70 3 >104 
CaF (Eu) Crystal 435 1000 2 104 
YAG Crystal 560 80 >104 
YAP Crystal 380 30 7 >10+ 


*P-46 and P-47 are yttrium aluminum garnet (YAG) and yttrium silicate, respectively, each doped 
with about 1% of cerium. The data were taken from several sources, including Blasse and Bril 
(1967), Pawley (1974), Autrata et al. (1983), and Engel et al. (1981). The efficiencies and radiation 
resistance should be regarded as approximate 


absorption or scattering within the scintillator, especially in the case of a transmis- 
sion screen (Fig. 2.23). The optimum mass thickness for P-47 powder appears to be 
about 10 mg/cm? for 100-keV electrons (Baumann et al., 1981). 

To increase the fraction of light entering the photomultiplier and prevent electro- 
static charging, the entrance surface of the scintillator is given a reflective metallic 
coating. Aluminum can be evaporated directly onto the surface of glass, plastic, and 
single-crystal scintillators. In the case of a powder-layer phosphor, the metal would 
penetrate between the crystallites and reduce the light output, so an aluminum film 
is prepared separately and floated onto the phosphor or else the phosphor layer is 
coated with a smooth layer of plastic (e.g., collodion) before being aluminized. 

The efficiency of many scintillators decreases with time as a result of irradiation 
damage (Table 2.2). This process is particularly rapid in plastics (Oldham et al., 
1971), but since the electron penetrates only about 100 jum, the damaged layer can 
be removed by grinding and polishing. In the case of inorganic crystals, the loss 
of efficiency is due mainly to darkening of the material (creation of color centers), 
resulting in absorption of the emitted radiation, and can sometimes be reversed by 
annealing the crystal (Wiggins, 1978). 

The decay time of a scintillator is of particular importance in electron counting. 
Plastics generally have time constants below 10 ns, allowing pulse counting up to 
at least 20 MHz. However, most scintillators have several time constants, extending 
sometimes up to several seconds. By shifting the effective baseline at the discrim- 
inator circuit, the longer time constants increase the “dead time” between counted 
pulses (Craven and Buggy, 1984). 

P-46 (cerium-doped Y3A15012) can be grown as a single crystal (Blasse and Bril, 
1967; Autrata et al., 1983) and combines high quantum efficiency with excellent 
radiation resistance. The light output is in the yellow region of the spectrum, which 
is efficiently detected by most photomultiplier tubes. Electron counting up to a few 
megahertz is possible with this material (Craven and Buggy, 1984). 
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2.4.4.4 Photomultiplier Tubes 


A photomultiplier tube contains a photocathode (which emits electrons in response 
to incident photons), several “dynode” electrodes (that accelerate the photoelec- 
trons and increase their number by a process of secondary emission), and an anode 
that collects the amplified electron current so that it can be fed into a preamplifier; 
see Fig. 2.23. To produce photoelectrons from visible photons, the photocathode 
must have a low work function and cesium antimonide is a popular choice, although 
single-crystal semiconductors such as gallium arsenide have also been used. 

The spectral response of a PMT depends on the material of the photocathode, 
its treatment during manufacture, and on the type of glass used in constructing the 
tube. Both sensitivity and spectral response can change with time as gas is liberated 
from internal surfaces and becomes adsorbed on the cathode. Photocathodes whose 
spectral response extends to longer wavelengths tend to have more “dark emission,” 
leading to a higher dark current at the anode and increased output noise. The dark 
current decreases by typically a factor of 10 when the PMT is cooled from room 
temperature to —30°C, but is increased if the cathode is exposed to room light (even 
with no voltages applied to the dynodes) or to strong light from a scintillator and 
can take several hours to return to its original value. 

The dynodes consist of a staggered sequence of electrodes with a secondary elec- 
tron yield of about 4, giving an overall gain of 10° or more if there are 10 electrodes. 
Gallium phosphide has been used for the first dynode, giving the higher secondary 
electron yield, improved signal/noise ratio, and easier discrimination against noise 
pulses in the electron counting mode (Engel et al., 1981). 

The PMT anode is usually operated at ground potential, the photocathode being 
at a negative voltage (typically —700 to —1500 V) and the dynode potentials sup- 
plied by a chain of low-noise resistors (Fig. 2.23). For analog operation, where the 
anode signal is treated as a continuous current, the PMT acts as an almost ideal cur- 
rent generator, the negative voltage developed at the anode being proportional to the 
load resistor and (within the linear region of operation) to the light input. Linearity 
is said to be within 3% provided the anode current does not exceed one-tenth of that 
flowing through the dynode resistance chain (Land, 1971). The electron gain can be 
controlled over a wide range by varying the voltage applied to the tube. Since the 
gain depends sensitively on this potential, the voltage stability of the power supply 
needs to be an order of magnitude better than the required stability of the output 
current. 

An electron whose energy is 10 keV or more produces some hundreds of photons 
within a typical scintillator. Even allowing for light loss before reaching the photo- 
cathode, the resulting negative pulse at the anode is well above the PMT noise level, 
so energy-loss electrons can be individually counted. The maximum counting fre- 
quency is determined by the decay time of the scintillator, the characteristics of the 
PMT, and the output circuitry. To ensure that the dynode potentials (and secondary 
electron gain) remain constant during the pulse interval, capacitors are placed across 
the final dynode resistors (Fig. 2.23). To maximize the pulse amplitude and avoid 
overlap of output pulses, the anode time constant R)C; must be less than the average 
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time between output pulses. The capacitance to ground C; is kept low by locating 
the preamplifier close to the PMT (Craven and Buggy, 1984). 


2.4.5 DQE of a Single-Channel System 


In addition to the energy-loss signal (S), the output of an electron detector contains 
noise (NV). The quality of the signal can be expressed in terms of a signal-to-noise 
ratio: SNR = S/N. However, some of this noise is already present within the electron 
beam in the form of shot noise; if the mean number of electrons recorded in a given 
time is n, the actual number recorded under the same conditions follows a Poisson 
distribution whose variance is m = n and whose standard deviation is ./m, giving 
an inherent signal/noise ratio: (SNR); = n/,/m = ./n. The noise performance of a 
detector is represented by its detective quantum efficiency (DQE): 


DQE = [SNR/(SNR);]* = (SNR)?/n (2.34) 


For an “ideal” detector that adds no noise to the signal: SNR = (SNR);, giving 
DQE = |. In general, DQE is not constant for a particular detector but depends on 
the incident electron intensity (Herrmann, 1984). 

The measured DQE of a scintillator/PMT detector is typically in the range of 0.5— 
0.9 for incident electron energies between 20 and 100 keV (Pawley, 1974; Baumann 
et al., 1981; Comins and Thirlwall, 1981). One reason for DQE < 1 concerns the 
statistics of photon production within the scintillator and photoelectron generation 
at the PMT photocathode. Assuming the Poisson statistics, it can be shown that 
DQE is limited to a value given by 


DQE < p/(1 + p) (2.35) 


where p is the average number of photoelectrons produced for each incident fast 
electron (Browne and Ward, 1982). For optimum noise performance, p is kept high 
by using an efficient scintillator, metallizing its front surface to reduce light losses 
and providing an efficient light path to the PMT. However, Eq. (2.35) shows that the 
DQE is only seriously degraded if p falls below about 10. 

DQE is also reduced as a result of the statistics of electron multiplication within 
the PMT and dark emission from the photocathode. These effects are minimized by 
using a material with a high secondary electron yield (e.g., GaP) for the first dynode 
and by using pulse counting of the output signal to discriminate against the dark 
current. In practice, the pulse—height distributions of the noise and signal pulses 
overlap (Engel et al., 1981), so that even at its correct setting a discriminator rejects 
a fraction f of the signal pulses, reducing the DQE by the factor (1 — f). The overlap 
occurs as a result of a high-energy tail in the noise distribution and because some 
signal pulses (e.g., due to electrons that are backscattered within the scintillator) 
are weaker than the others. In electron counting mode, the discriminator setting 
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therefore represents a compromise between loss of signal and increase in detector 
noise, both of which reduce the DQE. 

If the PMT is used in analog mode together with a V/F converter (see 
Section 2.4.6), DQE will be slightly lower than for pulse counting with the discrimi- 
nator operating at its optimum setting. In the low-loss region of the spectrum, lower 
DQE is unimportant since the signal/noise ratio is adequate. If an A/D converter 
used in conjunction with a filter circuit whose time constant is comparable to the 
dwell time per channel, there is an additional noise component. Besides variation in 
the number of fast electrons that arrive within a given dwell period, the contribution 
of a given electron to the sampled signal depends on its time of arrival (Tull, 1968) 
and the DQE is halved compared to the value obtained using a V/F converter, which 
integrates the charge pulses without the need of an input filter. 

The preceding discussion relates to the DQE of the electron detector alone. When 
this detector is part of a serial-acquisition system, one can define a detective quan- 
tum efficiency (DQE)syst for the recording system as a whole, taking n in Eq. (2.34) 
to be the number of electrons analyzed by the spectrometer during the acquisition 
time, rather than the number that passes through the detection slit. At any instant, 
the detector samples only those electrons that pass through the slit (width s) and so, 
evaluated over the entire acquisition, the fraction of analyzed electrons that reach the 
detector is s/Ax, where Ax is the image-plane distance over which the spectrometer 
exit beam is scanned. The overall DQE in serial mode can therefore be written as 


(DQE)syst = (8/Ax)(DQE detector (2.36) 


The energy resolution AF in the recorded data cannot be better than s/D, so 
Eq. (2.35) can be rewritten in the form 


(DQE)syst < (AE/Escan)(DQE) detector (2.37) 


where Egcan is the energy width of the recorded data. Typically, AE is in the range 
0.2—2 eV while Egcan may be in the range 100-5000 eV, so the overall DQE is 
usually below 1%. In a serial detection system, (DQE)syst can always be improved 
by widening the detection slit, but at the expense of degraded energy resolution. 


2.4.6 Serial-Mode Signal Processing 


We now discuss methods for converting the output of a serial-mode detector into 
numbers stored in computer memory. The detector is assumed to consist of a 
scintillator and PMT, although similar principles apply to solid-state detectors. 


2.4.6.1 Electron Counting 


Photomultiplier tubes have low noise and high sensitivity; some can even count 
photons. Within a suitable scintillator, a high-energy electron produces a rapid burst 
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of light containing many photons, so it is relatively easy to detect and count individ- 
ual electrons, resulting in a one-to-one relationship between the stored counts per 
channel and the number of energy-loss electrons reaching the detector. The intensity 
scale is then linear down to low count rates and the sensitivity of the detector is unaf- 
fected by changes in PMT gain arising from tube aging or power-supply fluctuations 
and should be independent of the incident electron energy. 

In order to extend electron counting down to low arrival rates, a lower level dis- 
criminator is used to eliminate anode signals generated by stray light, low-energy 
X-rays, Or noise sources within the PMT (mainly dark emission from the photo- 
cathode). If the PMT has single-photon sensitivity, dark emission produces discrete 
pulses at the anode, each containing G electrons, where G is the electron gain of the 
dynode chain. To accurately set the discriminator threshold, it is useful to measure 
the distribution of pulse amplitudes at the output of the PMT preamplifier, using an 
instrument with pulse—-height analysis (PHA) facilities or a fast oscilloscope. The 
pulse-height distribution should contain a maximum (at zero or low pulse ampli- 
tude) arising from noise and a second maximum that represents signal pulses; the 
discriminator threshold is placed between the peaks (Engel et al., 1981). 

A major limitation of pulse counting is that (owing to the distribution of decay 
times of the scintillator) the maximum count rate is only a few megahertz for P-46 
(Ce-YAG) and of the order of 20 MHz for a plastic scintillator (less if the scintillator 
has suffered radiation damage), rates that correspond to an electron current below 
4 pA. Since the incident beam current is typically in the range of | nA to | pA, 
alternative arrangements are usually necessary for recording the low-loss region of 
the spectrum. 


2.4.6.2 Analog/Digital Conversion 


At high incident rates, the charge pulses produced at the anode of a PMT merge 
and the preamplifier output becomes a continuous current or voltage, whose level is 
related to the electron flux falling on the scintillator. There are two ways of convert- 
ing this voltage into binary form for digital storage. One is to feed the preamplifier 
output into a voltage-to-frequency (V/F) converter (Maher et al., 1978; Zubin and 
Wiggins, 1980). This is essentially a voltage-controlled oscillator; its output con- 
sists of a continuous train of pulses that can be counted using the same scaling 
circuitry as employed for electron counting. The output frequency is proportional to 
the input voltage between (typically) 10 V and 10 V, providing excellent linearity 
over a large dynamic range. V/F converters are available with output rates as high as 
20 MHz. Unfortunately the output frequency is slightly temperature dependent, but 
this drift can be accommodated by providing a “‘zero-level” frequency-offset con- 
trol that is adjusted from time to time to keep the output rate down to a few counts 
per channel with the electron beam turned off. The minimum output rate should not 
fall to zero, since this condition could change to a lack of response at low electron 
intensity, resulting in recorded spectra whose channel contents vanish at some value 
of the energy loss (Joy and Maher, 1980a). Any remaining background within each 
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spectrum (measured, for example, to the left of the zero-loss peak) is subtracted 
digitally in computer memory. 

An alternative method of digitizing the analog output of a PMT is via an analog- 
to-digital (A/D) converter. The main disadvantage is limited dynamic range and the 
fact that, whereas the V/F converter effectively integrates the detector output over 
the dwell time per data channel, an A/D converter may sample the voltage level only 
once per channel. To eliminate contributions from high-frequency noise, the PMT 
output must therefore be smoothed with a time constant approximately equal to the 
dwell period per channel (Egerton and Kenway, 1979; Zubin and Wiggins, 1980), 
which requires resetting the filter circuit each time the dwell period is changed. 
Even if this is done, the smoothing introduces some smearing of the data between 
adjacent channels. The situation is improved by sampling the data many times per 
channel and taking an average. 


2.5 Parallel Recording of Energy-Loss Data 


A parallel-recording system utilizes a position-sensitive detector that is exposed to 
a broad range of energy loss. Because there is no energy-selecting slit, the detective 
quantum efficiency (DQE) of the recording system is the same as that of the detector, 
rather than being limited by Eq. (2.36). As a result, spectra can be recorded in 
shorter times and with less radiation dose than with serial acquisition, for the same 
noise content. These advantages are of particular importance for the spectroscopy 
of ionization edges at high energy loss, where the electron intensity is low. 

Photographic film was the earliest parallel-recording medium. With suitable 
emulsion thickness, the DQE exceeds 0.5 over a limited exposure range (Herrmann, 
1984; Zeitler, 1992). Its disadvantages are a limited dynamic range, the need for 
chemical processing, and the fact that densitometry is required to obtain quantitative 
data. 

Modern systems utilize a silicon diode array, such as a photodiode array (PDA) 
or charge-coupled diode (CCD) array. These two types differ in their internal mode 
of operation, but both provide a pulse-train output that can be fed to an electronic 
data storage system, just as in serial acquisition. A one-dimensional (linear) PDA 
was used in the original Gatan PEELS spectrometer but subsequent models use two- 
dimensional (area) CCD arrays. Appropriate readout circuitry allows the same array 
to be used for recording spectra, TEM images, and diffraction patterns. 


2.5.1 Types of Self-Scanning Diode Array 


The first parallel-recording detectors to be used with energy-loss spectrometers were 
linear photodiode arrays, containing typically 1024 silicon diodes. They were later 
replaced by two-dimensional charge-coupled diode (CCD) arrays, which are also 
used in astronomy and other optics applications. In the CCD, each diode is initially 
charged to the same potential and this charge is depleted by electrons and holes 
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created by the incident photons, in proportion to the local photon intensity. To read 
out the remaining charge on each diode, charge packets are moved into a trans- 
fer buffer and then to an output electrode (Zaluzec and Strauss, 1989; Berger and 
McMullan, 1989), during which time the electron beam is usually blanked. 

The slow-scan arrays used for EELS or TEM recording differ somewhat from 
those found in TV-rate video cameras. Their pixels are larger, allowing more charge 
to be stored per pixel and giving a larger dynamic range. The frame-transfer buffer 
can be eliminated, allowing almost the entire areas of the chip to be used for image 
recording and therefore a larger number of pixels (at least 1k x 1k is common). 
Finally, they are designed to operate below room temperature (e.g., —20°C) so that 
dark current and readout noise are reduced, which also improves the sensitivity and 
dynamic range. 

The Gatan Enfina spectrometer, frequently used with STEM systems, employs 
a rectangular CCD array (1340 x 100 pixels). The number of pixels in the nondis- 
persive direction, which are summed (binned) during readout, can be chosen to suit 
the required detector sensitivity, readout time, and dynamic range. Summing all 
100 pixels provides the highest sensitivity. The Gatan imaging filter (Section 2.6.1) 
contains a square CCD array fiber optically coupled to a thin scintillator. This equip- 
ment can be used to record either energy-loss spectra or energy-filtered images or 
diffraction patterns, depending on the settings of the preceding quadrupole lenses. 


2.5.2 Indirect Exposure Systems 


Diode arrays are designed as light-optical sensors and are used as such, together 
with a conversion screen (imaging scintillator), in an indirect exposure system. 
Figure 2.24 shows the general design of a system that employs a thin scintillator, 
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Fig. 2.24 Schematic diagram of a parallel-recording energy-loss spectrometer, courtesy of 
O. Krivanek 
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coupled by fiber-optic plate to a thermoelectrically cooled diode array. To provide 
sufficient energy resolution to examine fine structure in a loss spectrum, the spec- 
trometer dispersion is increased by multipole lenses. The main components of an 
indirect exposure system will now be discussed in sequence. 


2.5.2.1 Magnifying the Dispersion 


The spatial resolution (interdiode spacing) of a typical CCD is 14 wm, while the 
energy dispersion of a compact magnetic spectrometer is only a few micrometers 
per electron volt. To achieve a resolution of 1 eV or better, it is therefore necessary 
to magnify the spectrum onto the detector plane. A round lens can be used for this 
purpose (Johnson et al., 1981b) but it introduces a magnification-dependent rotation 
of the spectrum unless compensated by a second lens (Shuman and Kruit, 1985). 

A magnetic quadrupole lens provides efficient and rotation-free focusing in the 
vertical (dispersion) plane but does not focus in the horizontal direction, giving a 
line spectrum. In fact, a line spectrum is preferable because it involves lower current 
density and less risk of radiation damage to the scintillator. The simplest system 
consists of a single quadrupole (Egerton and Crozier, 1987) but using several allows 
the horizontal width to be controlled. Other quadrupole designs (Scott and Craven, 
1989; Stobbs and Boothroyd, 1991; McMullan et al., 1992) allow the spectrometer 
to form crossover at which an energy-selecting slit can be introduced in order to 
perform energy-filtered imaging in STEM or fixed-beam mode. 

Gatan spectrometers use several multipoles, allowing the final dispersion to be 
varied and (in the GIF system) an energy-filtered image or diffraction pattern to be 
projected onto the CCD array if an energy-selecting slit is inserted. The image is 
formed in CTEM mode, without raster scanning of the specimen, using the two- 
dimensional imaging properties of a magnetic prism (see Section 2.1.2). However, 
this operation requires corrections for image distortion and aberrations, hence the 
need for a complicated system of multipole lenses (Fig. 2.25), made possible by 
computer control of the currents in the individual multipoles. As seen in Fig. 2.26, 
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Fig. 2.25 Cross section of a Gatan GIF Tridiem energy-filtering spectrometer (model 863), show- 
ing the variable entrance aperture, magnetic prism with window-frame coils, multipole lens system, 
and CCD camera. Courtesy of Gatan 
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Fig. 2.26 GIF Quantum electron optics, from specimen image inside the magnetic prism (z = 0) to 
final image on the CCD camera (z = 0.68 m). Vertical bars indicate individual multipole elements 
and position of the energy-selecting slit. From Gubbens et al. (2010), copyright Elsevier 


there is a crossover in both x- and y-directions at the energy-selecting slit (double- 
focusing condition) and the dispersion is zero at the final image plane (achromatic 
image of the specimen) or its diffraction pattern. 


2.5.2.2 Conversion Screen 


The fluorescent screen used in a parallel-recording system performs the same func- 
tion as the scintillator in a serial-recording system, with similar requirements in 
terms of sensitivity and radiation resistance, but since it is an imaging component, 
spatial resolution and uniformity are also important. Good resolution is achieved 
by making the scintillator thin, which reduces lateral spreading of the incident 
electron beam. Resolution is specified in terms of a point-spread function (PSF), 
this being the response of the detector to an electron beam of small diameter (less 
than the interdiode spacing). The modulation transfer function (MTF) is the Fourier 
transform of the PSF and represents the response of the scintillator to sinusoidally 
varying illumination of different spatial frequencies. 

Uniformity is most easily achieved by use of an amorphous material, such as NE 
102 plastic or a single crystal such as CaF2, Nal, or Ce-doped yttrium aluminum 
garnet (YAG). Since organic materials and halides suffer radiation damage under 
a focused electron beam, YAG has been a common conversion screen material in 
parallel-recording spectrometers (Krivanek et al., 1987; Strauss et al., 1987; Batson, 
1988; Yoshida et al., 1991; McMullan et al., 1992) and CCD camera electron- 
imaging systems (Daberkow et al., 1991; Ishizuka, 1993; Krivanek and Mooney, 
1993). 

Single-crystal YAG is uniform in its light-emitting properties, emits yellow light 
to which silicon diode arrays are highly sensitive, and is relatively resistant to radi- 
ation damage (see Table 2.2). It can be thinned to below 50 ,.m and polished by 
standard petrographic techniques. The YAG can be bonded directly to a fiber-optic 
plate, using a material of high refractive index to ensure good transmission of light 
in the forward direction. Even so, some light is multiply reflected between the two 
surfaces of the YAG and may travel some distance before entering the array, giv- 
ing rise to extended tails on the point-spread function; see Fig. 2.27a. These tails are 
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Fig. 2.27 (a) Point-spread function for a photodiode array, showing the narrow central peak and 
extended tails. (b) Modulation transfer function, evaluated as the square root of the PSF power 
spectrum; the rapid fall to the shoulder S arises from the PSF tails. From Egerton et al. (1993), 
copyright Elsevier 


reduced by incorporating an antireflection coating between the front face of the YAG 
and its conducting coating. Long-range tails on the PSF are indicated by the low- 
frequency behavior of the MTF; see Fig. 2.27b. Many measurements of MTF and 
DQE have been made on CCD-imaging systems (Kujiwa and Krahl, 1992; Krivanek 
and Mooney, 1993; Zuo, 2000; Meyer and Kirkland, 2000; Thust, 2009; McMullan 
et al., 2009a; Riegler and Kothleitner, 2010). The noise properties and DQE of a 
parallel-recording system are discussed further in Section 2.5.4. 

Powder phosphors can be more efficient than YAG, and light scattering at grain 
boundaries reduces the multiple internal reflection that gives rise to the tails on the 
point-spread function. Variations in light output between individual grains add to 
the fixed-pattern noise of the detection system (Daberkow et al., 1991), which is 
usually removed by computer processing. 

Backscattering of electrons from the conversion screen may reduce the DQE of 
the system and subsequent scattering of these electrons back to the scintillator adds 
to the spectrometer background (Section 2.4.2). 


2.5.2.3 Light Coupling to the Array 


A convenient means of transferring the conversion-screen image to the diode array is 
by imaging (coherent) fiberoptics. The resulting optical system requires no focusing, 
has a good efficiency of light transmission, has no field aberrations (e.g., distortion), 
and is compact and rigid, minimizing the sensitivity to mechanical vibration. The 
fiber-optic plate can be bonded with transparent adhesive to the scintillator and with 
silicone oil to the diode array; minimizing the differences in refractive index reduces 
light loss by internal reflection at each interface. 
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Fiber-optic coupling is less satisfactory for electrons of higher energy 
(>200 keV), some of which penetrate the scintillator and cause radiation damage 
(darkening) of the fibers or generate x-rays that could damage a nearby diode array. 
Some electrons are backscattered from the fiber plate, causing light emission into 
adjacent diodes and thereby augmenting the tails on the response function (Gubbens 
et al., 1991). These problems are avoidable by using a self-supporting scintillator 
and glass lenses to transfer the image from the scintillator to the array. Lens optics 
allows the sensitivity of the detector to be varied (by means of an aperture stop) and 
makes it easier to introduce magnification or demagnification, so that the resolu- 
tion of the conversion screen and the detector can be matched in order to optimize 
the energy resolution and DQE (Batson, 1988). However, the light coupling is less 
efficient, resulting in decreased noise performance of the system. 


2.5.3 Direct Exposure Systems 


Although diode arrays are designed to detect visible photons, they also respond 
to charged particles such as electrons. A single 100-keV electron generates about 
27,000 electron-hole pairs in silicon, well above CCD readout noise, allowing a 
directly exposed array to achieve high DQE at low electron intensities. At very low 
intensity (less than one electron/diode within the integration period) there is the 
possibility of operation in an electron counting mode. 

This high sensitivity can be a disadvantage, since the saturation charge of even 
a large-aperture photodiode array is equivalent to only a few hundred directly inci- 
dent electrons, giving a dynamic range of *107 for a single readout. However, the 
sensitivity can be reduced by shortening the integration time and accumulating a 
large number of readouts, thereby increasing the dynamic range (Egerton, 1984). 
But to record the entire spectrum with a reasonable incident beam current (>1 pA), 
some form of dual system is needed, either using serial recording to record the low- 
loss region (Bourdillon and Stobbs, 1986) or using fast beam switching and a dual 
integration time on a CCD array (Gubbens et al., 2010). 

Direct exposure involves some risk of radiation damage to the diode array. To 
prevent rapid damage to field-effect transistors located along the edge of a photodi- 
ode array, Jones et al. (1982) masked this area from the beam. Even then, radiation 
damage can cause a gradual increase in dark current, resulting in increased diode 
shot noise and reduced dynamic range (Shuman, 1981). The damage mechanism is 
believed to involve creation of electron—hole pairs within the SiO2 passivating layer 
covering the diodes (Snow et al., 1967) and has been reported to be higher at 20-keV 
incident energy compared to 100 keV (Roberts et al., 1982). When bias voltages are 
removed, the device may recover, especially if the electron beam is left on (Egerton 
and Cheng, 1982). 

Since the dark current diminishes with decreasing temperature, cooling the array 
reduces the symptoms of electron-beam damage. Measurements on a photodiode 
array cooled to —30°C suggested an operating lifetime of at least 1000 h (Egerton 
and Cheng, 1982). Jones et al. (1982) reported no observable degradation for an 
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array kept at —40°C, provided the current density was below 10 wA/m?. Diode 
arrays can be operated at temperatures as low as —150°C for low-level photon 
detection (Vogt et al., 1978). 

Irradiation of a chemically thinned device from its back surface was proposed 
long ago as a way of avoiding oxide-charge buildup (Imura et al., 1971; Hier 
et al., 1979). More recently, electron counting and event-processed modes are being 
explored in connection with direct recording of low-intensity electron images on 
thinned CMOS devices (Vos et al., 2009; McMullan et al., 2009b) so there appears 
to be renewed hope for the direct-recording concept. 


2.5.4 DQE of a Parallel-Recording System 


Detective quantum efficiency (DQE) is a measure of the quality of a recording sys- 
tem: how little noise it adds to the electron image. In the case of a parallel-recording 
system, DQE is usually taken to be a function of spatial frequency in the recorded 
image. Here we present a simplified analysis in which DQE is represented as a sin- 
gle number, together with an interchannel mixing parameter that accounts for the 
width of the point-spread function (PSF). 

Consider a one- or two-dimensional array that is uniformly irradiated by fast elec- 
trons, N being the average number recorded by each element during the integration 
period. Random fluctuations (electron-beam shot noise) contribute a root-mean- 
square (rms) variation between channels of magnitude N'””, according to the Poisson 
statistics. The PSF of the detector may be wider than the interdiode spacing, so elec- 
trons arriving at a given location are spread over several diode channels, reducing 
the recorded electron-beam shot noise to 


Np = N!/7/s (2.38) 


where s is a smoothing (or mixing) factor that can be determined experimentally 
(Yoshida et al., 1991; Ishizuka, 1993; Egerton et al., 1993). It represents degradation 
of the spatial or energy resolution resulting from light spreading in the scintillator 
and any interchannel coupling in the array. 

In the case of indirect recording, N fast electrons generate (on average) Np pho- 
tons in the scintillator. If all electrons followed the same path, the statistical variation 
in the number of photons produced would be (Np) >. but in practice, each electron 
behaves differently. For example, some penetrate only a short distance before being 
backscattered and produce significantly fewer photons. Allowing for channel mix- 
ing and dividing by p so that the photon noise component Np is expressed in units 
of fast electrons, we must therefore write 


Np = s'N'?(Gp/p) (2.39) 


where op is the actual root-mean-square (rms) variation in light output. Monte Carlo 
simulations and measurements of the height distribution of photon pulses have given 
Op/p © 0.31 for a YAG scintillator that is thick enough to absorb the incident 
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electrons, and op/p ~*~ 0.59 for a 50-m YAG scintillator exposed to 200-keV 
electrons (Daberkow et al., 1991). 

Each photon produces an electron-hole pair in the diode array, but random 
variation (shot noise) in the diode leakage current and electronic noise (whose 
components may include switching noise, noise on supply and ground lines, video- 
amplifier noise, and digitization error of an A/D converter) add a total readout 
noise N;, expressed here in terms of fast electrons. It is also possible to include 
a fixed-pattern term N,,, representing the rms fractional variation v in gain between 
individual diode channels, which arises from differences in sensitivity between indi- 
vidual diodes, nonuniformities of the optical coupling, and variations in sensitivity 
of the scintillator. Adding all noise components in quadrature, the total noise N; is 
given by 


Ne = Np +NS+Np +N, (2.40) 


From Egg. (2.38), (2.39), and (2.40), the signal/noise ratio (SNR) of the diode array 
output is 


SNR = N/N, = N(N/s* + No,p?s~? + Ny + v7 N?)'/ (2.41) 


SNR increases with signal level N, tending asymptotically to 1/v. As an illustration, 
measurements on a PEELS detector based on a Hamamatsu $2304 photodiode array 
gave N, = 60 and a limiting SNR of 440, implying v = 0.23% (Egerton et al., 
1993). The shape of the SNR versus \; curve could be fitted with s = 5, as shown 
by the solid curve in Fig. 2.28a. 
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Fig. 2.28 (a) SNR and (b) DQE for a PEELS detector, as a function of signal level (up to sat- 
uration). The experimental data are for three different integration times and are matched by Eqs. 
(2.41) and (2.44) with s = 5. The parameter m represents the number of readouts. From Egerton 
et al. (1993), copyright Elsevier 
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As in the case of a single-channel system, the quality of the detector is 
represented by its DQE, which can be defined as 


DQE = [(SNR)/(SNR);]? (2.42) 


According to this definition, (SNR); is the signal/noise ratio of an ideal detector that 
has the same energy resolution (similar PSF) but with N, = N,; = Ny = 0, so that 


(SNR); = N/Np = sN1/? (2.43) 
Making use of Eqs. (2.41), (2.42), and (2.43), the detective quantum efficiency is 
DQE = s-?(SNR)’/N = (1 +0, /p? +s°N?/N +?s? Ny! (2.44) 


Electron-beam shot noise is represented by the first term in parentheses in 
Eq. (2.44); the other terms cause DQE to be less than 1. For low N, the third term 
becomes large and DQE is reduced by readout noise. At large N, the fourth term 
predominates and DQE is reduced as a result of gain variations. Between these 
extremes, DQE reaches a maximum, as illustrated in Fig. 2.28b. Note that if the 
smoothing effect of the PSF is ignored, the apparent DQE, if defined as (SNR)*/N, 
can exceed unity. 

Provided the gain variations are reproducible, they constitute a fixed pattern 
that can be removed by signal processing (Section 2.5.5), making the last term 
in Eq. (2.44) zero. The detective quantum efficiency then increases to a limiting 
value at large N, equal to (1 + c /p’)-' ~ 0.9 for a 50-m YAG scintillator 
(thick enough to absorb 100-keV electrons), and this high DQE allows the detec- 
tion of small fractional changes in electron intensity, corresponding to elemental 
concentrations below 0.1 at.% (Leapman and Newbury, 1993). 


2.5.4.1 Multiple Readouts 


If a given recording time T is split into m periods, by accumulating m readouts in 
computer memory, the electron-beam and diode array shot noise components are in 
principle unaltered, since they depend only on the total integration time. The noise 
due to gain variations, which depends on the total recorded signal, should also be 
the same. But if readout noise (exclusive of diode shot noise) adds independently 
at each readout, its total is augmented by a factor of m!”, increasing the N,? term 
in Eq. (2.44) by a factor of m. As a result, the DQE will be lower, as shown for a 
photodiode array in Fig. 2.28b. 

Modern CCD arrays have a low readout noise: 50 diode electrons or less for 
the array used in a GIF Quantum spectrometer (Gubbens et al., 2010). However, 
an array cannot be read out instantaneously; a readout time of 115 ms has been 
quoted for an Enfina spectrometer (Bosman and Keast, 2008). During this time, 
the beam in a Gatan system is usually blanked but the specimen is still irradiated, 
possibly undergoing radiation damage. So for a given specimen irradiation time, 
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increasing number of readouts reduces the spectrum-recording time, lowering the 
SNR and DQE. If beam blanking is not applied, gain variations between different 
spectral channels are increased; however, the situation can be improved by binned 
gain averaging (Bosman and Keast, 2008); see Section 2.5.5. 

One advantage of multiple readouts is increased dynamic range. The minimum 
signal that can be detected is Smin = FM, where N; is the total noise and F is 
the minimum acceptable signal/noise ratio, often taken as 5 (Rose, 1970). Since 
some of the noise components are not increased by multiple readouts, the total noise 
increases by a factor Jess than m!?, The maximum signal that can be recorded in m 
readouts is 


Smax = ™(Qsat — [aT /m) = MOsat — LaT (2.45) 


where Qgat is the diode-saturation charge and Jg is the diode thermal-leakage current. 
Smax 18 increased by more than a factor of m compared to the largest signal (Qsat — 
I4T) that can be recorded in the same time with a single readout. Therefore the 
dynamic range Smax/Smin of the detector is increased by more than a factor of mi/2 by 
use of multiple readouts. A further advantage of multiple readouts is that spectrum 
drift can be compensated inside the data-recording system. 

The choice of m is therefore a compromise. To obtain adequate dynamic range 
with the least penalty in terms of readout noise, the integration time per read- 
out should be adjusted so that the array output almost saturates at each readout. 
This procedure minimizes the number of readouts in a given specimen irradiation 
time, optimizing the signal/noise ratio and minimizing any radiation damage to the 
specimen. 


2.5.5 Dealing with Diode Array Artifacts 


The extended tails of the detector point-spread function distort all spectral features 
recorded with a diode array. Since the tails contain only low spatial frequencies, they 
can be largely removed by Fourier ratio deconvolution, taking the zero-loss peak as 
representing the detector PSF (Mooney et al., 1993). To avoid noise amplification, 
the central (Gaussian) portion of the zero-loss peak can be used as a reconvolution 
function; see Section 4.1.2. 

Some kinds of diode array suffer from incomplete discharge: each readout con- 
tains a partial memory of previous ones. Under these conditions, it is advisable to 
discard several readouts if acquisition conditions, such as the beam current or the 
region of specimen analyzed, are suddenly changed. Longer term memory effects 
arise from electrostatic charging within the scintillator, resulting in a local increase 
in sensitivity in regions of high electron intensity. This condition is alleviated by 
prolonged exposure to a broad, undispersed electron beam. 

The thermal leakage current varies slightly between individual diodes, so the dark 
current background is not quite constant across a spectrum. This effect is usually 
removed by recording the array output with the electron beam blanked off. Even so, 
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Fig. 2.29 (a) Summations of 170 spectra from two areas of the field of view (A and B) with no 
specimen present, showing the high degree of correlation of the noise. (b) Result of gain averaging 
2025 spectra with different amounts n of spectral shift. (c) Boron nitride K-edge spectra acquired 
with no CCD binning and a single gain reference or a separate gain reference for each readout, 
compared with the result of binned gain averaging (lower curve). From Bosman and Keast (2008), 
copyright Elsevier 


artifacts remain because the sensitivity of each diode varies slightly and (for indirect 
recording) the phosphor screen may be nonuniform in its response. The result is a 
correlated or fixed-pattern noise, the same in each readout, which may considerably 
exceed the electron-beam shot noise; see Fig. 2.29a. 


2.5.5.1 Gain Normalization 


Variations in sensitivity (gain) across the array are most conveniently dealt with 
by dividing each spectral readout by the response of the array when illuminated 
by a broad undispersed beam. This gain normalization or flat-fielding procedure 
can achieve a precision of better than 1% for two-dimensional (CCD) detectors 
(Krivanek et al., 1995) and it therefore works well enough in many cases. However, 
other methods, originally developed to correct for gain variations in linear photodi- 
ode arrays, may be necessary when weak fine-structural features must be extracted 
from EELS data. 


2.5.5.2 Gain Averaging 


First applied to low-energy HREELS (Hicks et al., 1980), this method of eliminating 
the effect of sensitivity variations involves scanning a spectrum along the array in 
one-channel increments and performing a readout at each scan voltage. The result- 
ing spectra are electronically shifted into register before being added together in 
data memory. Within the range of energy loss sampled by all diodes of the array, the 
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effect of gain variations should exactly cancel. If the array contains N elements and 
gain-corrected data are required in M channels, N-+M spectra must be accumulated. 
Because M may be greater than N, the recorded spectral range can be greater than 
that covered by the array. However, some electrons fall outside the array and are 
not recorded, so the system DQE is reduced by a factor of N/(N + M) compared 
to parallel recording of N + M spectra which are stationary on the array. Batson 
(1988) implemented this method for a 512-channel photodiode array with M up to 
300, substantial computing being required to correct for the fact that the scan step 
was not equal to the interdiode spacing. The same procedure has been used with a 
smaller number of readouts (WM < N), in which case the gain variations are reduced 
but not eliminated. 


2.5.5.3 Binned Gain Averaging 


Bosman and Keast (2008) reported a variant of the above procedure, in which spec- 
tra are recorded from a two-dimensional CCD detector while a linear ramp voltage 
is applied to the spectrometer drift tube. For fast readout, the spectra are binned, 
i.e., all CCD elements corresponding to a given spectral channel are combined in 
an on-chip register during readout, with no dark current or gain corrections applied. 
Using the same readout settings, another series of N spectra is acquired without 
illuminating the detector, all of which are summed and divided by N to give an 
average spectrum that is subtracted from each individual spectrum of the first data 
set. This procedure ensures a dark reference with high SNR (including any differ- 
ence in efficiency between the quadrants, in a four-quadrant detector). Finally, the 
dark-corrected spectra from the first data set are aligned (by Gaussian fitting to a 
prominent spectral feature or by using the drift-tube voltage as the required shift) 
and summed, or used for EELS mapping. The dynamic range of each readout is 
limited by the capacity of the register pixels but increases when the readouts are 
combined. The result can be a dramatic reduction in the noise content of a spectrum; 
see Fig. 2.29c. 


2.5.5.4 Iterative Gain Averaging 


Another possibility is to record M spectra, each of the form J;,(E)G(E), with suc- 
cessive energy shifts A between each spectrum, then electronically shift them back 
into register and add them to give a single spectrum: 


m=M 
S\(E) = 7 > Jm(E)G(E — mA) (2.46) 


m=0 


where the gain variations are spread out and reduced in amplitude by a factor ~M eo 
Each original spectrum is then divided by S)(E£) and the result averaged over all M 
spectra 
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to give a first estimate G(£) of the gain profile of the detector so that each original 
spectrum can be corrected for gain variations: 


Sm(E) _ Jm(E) G(E) 
GE) —- Gi (E) 


S\(E) = (2.48) 


The process is then repeated, with the M spectra S,,(E) replaced by S'(E), to obtain 
revised data S2(E), G2(E), and S?(B), and this procedure repeated until the effect 
of gain variations becomes negligible. Schattschneider and Jonas (1993) analyzed 
this method in detail, showing that the variance due to gain fluctuations is inversely 
proportional to the square root of the number of iterations. 


2.5.5.5 Energy-Difference Spectra 


By recording two or three spectra, J)(E — €), Jo(E), and J3(E + ¢), displaced in 
energy by applying a small voltage ¢ to the spectrometer drift tube, first-difference 
FD(E) or second-difference SD(E) spectra can be computed as 


FD(E) = Ji(E — €) — J2(E) (2.49) 
SD(E) = J\(E — €) — 2J2(E) + J3(E + €) (2.50) 


Writing the original spectrum as J(E) = G(E)[A + BE + C(E)], where G(E) 
represents gain modulation by the detector, gives (for small ¢) 


FD(E) = G(E)[—Be + CE — €) — C(E)] 


(2.51) 
= G(E)[—Be + e—!(dC/dE)| 


SD(E) = G(E)[CE — €) — 2C(E) + C(E + £)] 


(2.52) 
= G(E)[e~7(d?C /dE*)] 

Because component A is absent from FD(£), gain modulation of any constant back- 
ground is removed when forming a first-difference spectrum (Shuman and Kruit, 
1985). Likewise, gain modulation of any linearly varying “background” component 
is removed when forming SD(£). Consequently, the signal/background ratio of gen- 
uine spectral features is enhanced. The resulting spectra, which resemble first and 
second derivatives of the original data, are highly sensitive to spectral fine structure, 
making quantitative treatment of the data more difficult. FD(E) can be integrated 
digitally, the integration constant A being estimated by matching to the original 
spectra. The statistical noise content (excluding gain variations) in the jth channel 
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of the integrated spectrum J(/) is increased relative to a directly acquired spectrum 
by a factor of [2G — 1)]!/? for ¢ = 1 channel. 


2.5.5.6 Dynamic Calibration Method 


Shuman and Kruit (1985) proposed a gain-correction procedure in which the gain 
G(j) of the jth spectral channel is calculated from two difference spectra, J; (i) and 
J2(i), shifted by one channel: 


Gi) _ isj . i=j-1 
Ga Lae / I] Jo(i) (2.53) 


where IT represents a product of the contents J(i) of all channels between the stated 
limits of i. By multiplying one of the original spectra by G(j), the effect of gain 
variations is removed, although the random-noise content of J(/) is increased by a 
factor of [2(j — 1)]!/?. 


2.6 Energy-Selected Imaging (ESI) 


As discussed in Section 2.1, the information carried by inelastic scattering can be 
acquired and displayed in several ways, one of these being the energy-loss spec- 
trum. In a transmission electron microscope, the volume of material giving rise to 
the spectrum can be made very small by concentrating the incident electrons into 
a small-diameter probe. EELS can be used to quantitatively analyze this small vol- 
ume, the TEM image or diffraction pattern being used to define it relative to its 
surroundings (Fig. 2.30a). This process can then be repeated for other regions of 
the specimen, either manually or by raster scanning a small probe across the speci- 
men and collecting a spectrum from every pixel, as in STEM spectrum imaging; see 
Fig. 2.30c. However, it is sometimes useful to display some spectral feature, such as 
the ionization edge representing a single chemical element, simultaneously, using 
the imaging or diffraction capabilities of a conventional TEM. An image-forming 
spectrometer is used as a filter that accepts energy losses within a specified range, 
giving an energy-selected image or diffraction pattern. We now discuss several 
instrumental arrangements that achieve this capability. 


2.6.1 Post-column Energy Filter 


As discussed in Section 2.1.1, the magnetic prism behaves somewhat like an electron 
lens, producing a chromatically dispersed image of the spectrometer object O at the 
plane of an energy-selecting slit or diode array detector. A conventional TEM not 
only provides a suitable small-diameter object at its projector lens crossover but also 
produces a magnified image or diffraction pattern of the specimen at the level of 
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Fig. 2.30 Three common types of instrument used for EELS and energy-filtered imaging: (a) 
imaging filter below the TEM column, (b) in-column filter, and (c) STEM system. In an instrument 
dedicated to STEM, the arrangement is often inverted, with the electron source at the bottom 


the viewing screen, closer to the spectrometer entrance. The spectrometer therefore 
forms a second image, further from its exit, which corresponds to a magnified view 
of the specimen or its diffraction pattern. This second image will be energy filtered 
if an energy-selecting slit is inserted at the plane of the first image (the energy-loss 
spectrum). Slit design is discussed in Section 2.4.4.1. 

In general, the energy-filtered image suffers from several defects. Its magnifi- 
cation is different in the x- and y-directions, although such rectangular distortion 
can easily be corrected electronically if the image is recorded into a computer. It 
exhibits axial astigmatism (the spectrometer is double focusing only at the spec- 
trum plane), which is correctable by using the dipole-coil stigmators (Shuman and 
Somlyo, 1982). More problematic, it may be a chromatic image, blurred in the 
x-direction by an amount dependent on the width of the energy-selecting slit. Even 
so, by using a 20-\1m slit (giving an energy resolution of 5 eV) and a single-prism 
spectrometer of conventional design, Shuman and Somlyo (1982) obtained an image 
with a spatial resolution of 1.5 nm over a 2-j.m field of view at the specimen plane. 

The Gatan imaging filter (GIF) uses post-spectrometer multipoles to correct for 
these image defects and form a good-quality image or diffraction pattern of the spec- 
imen, which is recorded by a CCD camera. Alternatively, the energy-selecting slit 
can be withdrawn and the GIF lens system used to project an energy-loss spectrum, 
allowing the same camera to be used for spectroscopy, as in Fig. 2.30a. The electron 
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optics is discussed briefly in Section 2.5.2. Since a modern TEM usually has STEM 
capabilities, the GIF can also be used to acquire spectrum images. 


2.6.2 In-Column Filters 


The prism—mirror and omega filters (Section 2.1.2) were developed specifically 
for producing EFTEM images and diffraction patterns. Located in the middle of a 
TEM column, they are designed as part of a complete system rather than as add-on 
attachments to a microscope. Following the construction of such systems in several 
laboratories (Castaing et al., 1967; Henkelman and Ottensmeyer, 1974a,b; Egerton 
et al., 1974; Zanchi et al., 1977a; Krahl et al., 1978), energy-selecting microscopes 
are now produced commercially (Egle et al., 1984; Bihr et al., 1991; Tsuno et al., 
1997; Koch et al., 2006). Figure 2.31 shows the electron optics of a Zeiss omega- 
filter system; the operating modes of this kind of instrument have been described by 
Reimer (1991). 

Because of the symmetry of these multiple-deflection systems, the image or 
diffraction pattern produced is achromatic and has no distortion or axial astig- 
matism. Midplane symmetry also precludes second-order aperture aberrations and 
image-plane distortion (Rose, 1989). Reduction of axial aberration in the energy- 
selecting plane and of field astigmatism and tilt of the final image requires 
sextupoles or curved pole faces (Rose and Pejas, 1979; Jiang and Ottensmeyer, 
1993), or optimization of the optical parameters (Lanio, 1986; Krahl et al., 1990). 
Without such measures, the selected energy loss changes over the field of view, 
so that with a narrow energy-selecting slit and no specimen in the microscope, the 
zero-loss electrons would occupy a limited area on the final screen, equivalent to 
the alignment pattern of a single magnetic prism (Section 2.2.5). Or with an alu- 
minum specimen, a low-magnification energy-selected image exhibits concentric 
rings corresponding to multiples of the plasmon energy (Zanchi et al., 1977a). 

The Zeiss SESAM instrument represents a recent version of this concept (Koch 
et al., 2006). It uses a Schottky source, an electrostatic Omega-type filter as a 
monochromator and a magnetic MANDOLINE filter as the spectrometer and energy 
filter; see Fig. 2.32. To minimize vibrations, the TEM column is suspended at the 
level of the objective lens. A comparable project, the JEOL MIRAI instrument, uses 
a dual Wien filter as the monochromator and has achieved an energy resolution of 
0.14 eV at 200 kV accelerating voltage (Mukai et al., 2004). 


2.6.3 Energy Filtering in STEM Mode 


Energy filtering is relatively straightforward in the case of a scanning transmission 
electron microscope, where the incident electrons are focused into a very small 
probe that is scanned over the specimen in a raster pattern. A filtered image can 
be viewed simply by directing the output of a single-channel electron detector (as 
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Fig. 2.31 Zeiss EM912 energy-filtering microscope: solid lines show field-defining rays, dashed 
lines represent image-defining trajectories. The Omega filter produces a unit-magnification 
achromatic image of the specimen (or its diffraction pattern) created by the first group of post- 
specimen lenses and generates (at the plane of the energy-selecting slit) a unit-magnification 
energy-dispersed image of its entrance crossover. From Carl Zeiss Topics, Issue 4, p. 4 


used in serial EELS) to an image display monitor. It is preferable to digitally scan 
the probe and to digitize the detector signal so that it can be stored in computer 
memory, allowing subsequent image processing. 

In STEM mode, the spatial resolution of a filtered image depends on the 
incident probe diameter (spot size) and beam broadening in the specimen, but 
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Fig. 2.32 Components of the Zeiss SESAM instrument: (a) electrostatic omega filter (dispersion 
~ 12 «m/eV at the midplane slit) and (b) MANDOLINE filter, whose dispersion at the energy- 
selecting plane exceeds 6 jzm/eV 


is independent of the spectrometer. For an instrument fitted with a Schottky 
or field-emission source, the probe diameter can be below 1 nm or less 
than 0.1 nm for an aberration-corrected probe-forming lens (Krivanek et al., 
2008). 

If electron lenses are present between the specimen and spectrometer, the mode 
of operation of these lenses does affect the energy resolution (Section 2.3.4). Also, 
the scanning action of the probe can result in a corresponding motion of the spec- 
trum, due to movement of the spectrometer object point (image coupling) or through 
spectrometer aberrations (for diffraction coupling). Different regions of the image 
then correspond to different energy loss, limiting the field of view for a given energy 
window. The field of view d can be increased by widening the energy-selecting slit, 
but this limits the energy resolution to MM,d/D (see Section 2.3.5). A preferable 
solution is to descan the electron beam by applying the raster signal to dipole scan 
coils located after the specimen (Fig. 2.33). Ideally, the field of view is then unlim- 
ited and the energy resolution is the same as for a stationary probe. As there is 
no equivalent of descanning in a fixed-beam TEM, the use of STEM mode makes 
it easier to obtain good energy resolution in lower magnification energy-filtered 
images. 

The STEM spectrometer can also be used to obtain an energy-selected diffraction 
pattern, by using x- and y-deflection coils to scan the latter across the spectrome- 
ter entrance aperture. However, this method is inefficient, since a large proportion 
of the electrons are rejected by the angle-defining aperture. If the required energy 
resolution is Af and the scan range is +f, the collection efficiency (and system 
DQE) is (AB/B)*/4. Shorter recording times and lower radiation dose are possible 
by using an imaging filter to process the whole diffraction pattern simultaneously 
(Krivanek et al., 1994), making the recording of core-loss diffraction patterns more 
feasible (Botton, 2005). 
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Fig. 2.33 Scheme for STEM spectrum imaging (Gubbens et al., 2010). A focused probe is scanned 
across the specimen and (for lower magnification work) descanned using post-specimen coils. At 
each pixel, an extended range of the energy-loss spectrum, chosen by the voltage voltage-scan 
module (VSM), is acquired by a parallel-recording spectrometer. An electrostatic shutter within 
the spectrometer allows fast switching between the low-loss and core-loss regions of the spectrum 


2.6.4 Spectrum Imaging 


With a parallel-recording system attached to a scanning-transmission electron 
microscope (STEM), the energy-loss spectrum can be read out at each picture point, 
creating a four-dimensional data array that corresponds to electron intensity within 
a three-dimensional (x, y, E) data cube (Fig. 2.1). The number of intensity values 
involved is large: 500 M for a512 x 512 image, in the case of a 2048-channel spec- 
trum. If each spectral intensity is recorded to a depth of 16 bits (64K gray levels), the 
total information content is then 0.5 Gbyte if the data are stored as integers (with- 
out data compression). However, advances in electronics have made the acquisition 
and storage of such data routine (Gubbens et al., 2010). To be efficient, the process 
requires good synchronization between the control computer, the CCD camera, and 
the probe-scanning and voltage-offset modules; see Fig. 2.33. 

An equivalent data set can be obtained from an imaging filter, by reading out 
a series of energy-selected images at different energy loss, sometimes called an 
image spectrum (Lavergne et al., 1992). Since electrons intercepted by the energy- 
selecting slit do not contribute, this process is less efficient than spectrum imaging 
and involves a higher electron dose to the specimen, for the same information con- 
tent. However, it may involve a shorter recording time, if the incident beam current 
divided by the number of energy-selected images exceeds the probe current used 
in the spectrum imaging; see Section 2.6.5. Common problems are undersampling 
(dependent on the energy shift between readouts) and a loss of energy resolution 
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(due to the width of the energy-selecting slit) but these can be addressed by FFT 
interpolation and deconvolution methods (Lo et al., 2001). 

The main attraction of the spectrum image concept is that more spectroscopic 
data are recorded and can be subsequently processed to extract information that 
might otherwise have been lost. Examples of such processing include the calculation 
of local thickness, pre-edge background subtraction, deconvolution, multivariate 
analysis, and Kramers—Kronig analysis. The resulting information can be displayed 
as line scans (Tencé et al., 1995) or two-dimensional images of specimen thickness, 
elemental concentration, complex permittivity, and bonding information (Hunt and 
Williams, 1991; Botton and L’Esperance, 1994; Arenal et al., 2008). In addition, 
instrumental artifacts such as gain nonuniformities and drift of the microscope high 
voltage or beam current can be corrected by post-acquisition processing. 

The acquisition time of a spectrum image is often quite long. In the past, the min- 
imum pixel time has been limited by the array readout time, but that has recently 
been reduced from 25 to 1 ms (Gubbens et al., 2010). A line spectrum, achieved by 
scanning an electron probe in a line and recording a spectrum from each pixel, can 
be acquired more rapidly and is often sufficient for determining elemental profiles. 
Similar data can be obtained in fixed-beam TEM mode, with broad beam illumi- 
nating a slit introduced at the entrance of a double-focusing spectrometer. The long 
direction (y) of the slit corresponds to the nondispersive direction in the spectrome- 
ter image plane, allowing the energy-loss intensity J(y,E) to be recorded by a CCD 
camera. One advantage here is that all spectra are acquired simultaneously, so spec- 
imen drift does not distort the information obtained, although it may result in loss 
of spatial resolution. 

A common form of energy-filtered image involves selecting a range of energy 
loss (typically 10 eV or more in width) corresponding to an inner-shell ioniza- 
tion edge. Since each edge is characteristic of a particular element, the core-loss 
image contains information about the spatial distribution of elements present in 
the specimen. However, each ionization edge is superimposed on a spectral back- 
ground arising from other energy-loss processes. To obtain an image that represents 
the characteristic loss intensity alone, the background contribution /, within the 
core-loss region of the spectrum must be subtracted, as in the case of spectroscopy 
(Section 4.4). The background intensity often decreases smoothly with energy loss 
E, approximating to a power law form J(E) = AE~", where A and r are parameters 
that can be determined by examining J(E) at energy losses just below the ionization 
threshold (Section 4.4.2). Unfortunately, both A and r can vary across the specimen, 
as a result of changes in thickness and composition (Leapman and Swyt, 1983; 
Leapman et al., 1984c), in which case a separate estimation of J, is required at each 
picture element (pixel). 

In the case of STEM imaging, where each pixel is measured sequentially, local 
values of A and r can be obtained through a least-squares or two-area fitting to the 
pre-edge intensity recorded over several channels preceding the edge. With electro- 
static deflection of the spectrometer exit beam and fast electronics, the necessary 
data processing may be done within each pixel dwell period (“on the fly”) and the 
system can provide a live display of the appropriate part of the spectrum (Gorlen 
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et al., 1984), values of A and r being obtained by measuring two or more energy-loss 
channels preceding the edge (Jeanguillaume et al., 1978). By recording a complete 
line of the picture before changing the spectrometer excitation, the need for fast 
beam deflection is avoided, background subtraction and fitting being done off-line 
after storing the images (Tencé et al., 1984). 

In the case of energy filtering in a fixed-beam TEM, the simplest method of 
background subtraction is to record one image at an energy loss just below the ion- 
ization edge of interest and subtract some constant fraction of its intensity from 
a second image recorded just above the ionization threshold. First done by photo- 
graphic recording (Ottensmeyer and Andrew, 1980; Bazett-Jones and Ottensmeyer, 
1981), the subtraction process is more accurate and convenient with CCD images. 
Nevertheless, this simple procedure assumes that the exponent r that describes the 
energy dependence of the background is constant across the image or that the back- 
ground is a constant fraction of the core-loss intensity. In practice, r varies as the 
local composition or thickness of the specimen changes (Fig. 3.35), making this 
two-window method of background subtraction potentially inaccurate (Leapman 
et al., 1984c). Variation of r is taken into account in the three-window method by 
electronically recording two background-loss images at slightly different energy 
loss and determining A and r at each pixel. However, the reduction in systematic 
error of background fitting comes at the expense of an increased statistical error 
(Section 4.4.3), so a longer recording time is needed for an acceptable signal/noise 
ratio (Leapman and Swyt, 1983; Pun and Ellis, 1983). 


2.6.4.1 Influence of Diffraction Contrast 


Even if the background intensity is correctly subtracted, a core-loss image may be 
modulated by diffraction (aperture) contrast, arising from variations in the amount 
of elastic scattering intercepted by the angle-limiting aperture. A simple test is 
to examine an unfiltered bright-field image, recorded at Gaussian focus using the 
same collection aperture; the intensity modulation in this image is a measure of the 
amount of diffraction contrast. Several methods have been proposed for removing 
this aperture contrast, in order to obtain a true elemental map: 


(a) Dividing the core-loss intensity J, by a pre-edge background level, to form a 

jump-ratio image (Johnson, 1979). 

Dividing by the intensity of a low-loss (e.g., first-plasmon) peak, which is also 

modulated by diffraction contrast. 

Dividing J; by the intensity {, measured over an equal energy window in the 

low-loss region of the spectrum (Egerton, 1978a; Butler et al., 1982). According 

to Eq. (4.65), the ratio I,/f is proportional to areal density (number of atoms of 

an analyzed element per unit area of the specimen). 

Taking a ratio of the core-loss intensities of two elements, giving an image that 

represents their elemental ratio; see Eq. (4.66). 

(e) Using conical rocking-beam illumination, which varies the angle of incidence 
over a wide range (Hofer and Warbichler, 1996). 


(b 


wm 


(c 


wm 


(d 


wm 
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(f) Recording the filtered images with a large collection semi-angle, so that almost 


all the inelastic and elastic scattering enters the spectrometer (Egerton, 198 1d; 
Muller et al., 2008). 


Methods (a), (b), and (d) have the advantage that they correct also for variations 
in specimen thickness (if the thickness is not too large), giving an image intensity 


that reflects the concentration of the analyzed element (atoms/volume) rather than 
its areal density. 


2.6.5 Comparison of Energy-Filtered TEM and STEM 


To compare the advantages of the fixed-beam TEM and STEM procedures of energy 
filtering, we will assume equal collection efficiency (same £*; see Section 4.5.3), 
similar spectrometer performance in terms of energy resolution over the field of 
view, and electron detectors with similar noise properties. We will take the spatial 
resolution to be the same in both methods, a resolution below | nm being achievable 
(for some specimens) using either procedure. 

Consider first elemental mapping with a TEM imaging filter, in comparison with 
a STEM system operating with an energy-selecting slit (as in serial recording). At 
any instant, a single energy loss is recorded, energy-selecting slits being present 
in both cases; see Fig. 2.34. For two-window background fitting, three complete 
images are acquired for each element in either mode, as discussed in the previ- 
ous section. For the same amount of information, the electron dose is therefore the 
same in both methods. The only difference is that the electron dose is delivered 


continuously in the fixed-beam TEM and for a small fraction of the frame time in 
STEM mode. 
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In the STEM case, the dose rate (current density in the probe) can be consider- 
ably higher than for broad-beam TEM illumination. Whether an equal dose implies 
the same amount of radiation damage to the specimen then depends on whether the 
radiation damage is dose rate dependent. For polymer specimens, temperature rise 
in the beam can increase the amount of structural damage for a given dose (Payne 
and Beamson, 1993). In some inorganic oxides, mass loss (hole drilling) occurs 
only at the high current densities such as are possible in a field-emission STEM (see 
Section 5.7.5). For such specimens, the STEM procedure would be more damaging, 
for the same recorded information. 

The recording time for a single-element map is generally longer in STEM, since 
the probe current (even with a field-emission source) is typically below 1 nA, 
whereas the beam current in a conventional TEM can be as high as | wA. STEM 
recording of a512 x 512-pixel image may take as much as | h to achieve adequate 
statistics, which is inconvenient and requires specimen drift correction. The usual 
solution is to reduce the amount of information recorded by decreasing the number 
of pixels. In a conventional TEM, the CCD camera offers typically 2k x 2k pixels, 
so EFTEM can provide a higher information rate. 

When a parallel-recording spectrometer is used in the STEM, the three images 
required for each element are recorded simultaneously, reducing both the time and 
the dose by a factor of 3. This factor becomes 3n in a case where n ionization edges 
are recorded simultaneously. STEM with a parallel-recording spectrometer should 
therefore produce less radiation damage than an energy-filtering TEM, unless dose 
rate effects occur that outweigh this advantage. 

The advantage of STEM is increased further when an extended range of energy 
loss is recorded by the diode array, as in spectrum imaging. If use is made of the 
information recorded by all N detector channels, a given electron dose to the spec- 
imen yields N times as much information as in EFTEM, where N energy-selected 
images would need to be acquired sequentially to form an image spectrum of equal 
information content. STEM spectrum imaging also makes it easier to perform on- 
or off-line correction for specimen drift, so that long recording times (although 
inconvenient) do not necessarily compromise the spatial resolution of analysis. The 
acquisition time in STEM could actually be shorter if the product NJ,, where Jy is 
the probe current, exceeds the beam current used for EFTEM imaging. 

Both scanning and fixed-beam modes of operation are possible in a conventional 
TEM equipped with probe scanning and (preferably) a field-emission source and 
parallel-recording spectrometer. With such an instrument, single-element imaging 
is likely to be faster in EFTEM mode, but in the case of multielement imaging or 
the need for data covering a large range of energy loss, the STEM mode is more 
efficient in terms of specimen dose and (possibly) acquisition time. 

The above arguments assume that the whole of the imaged area is uniformly irra- 
diated in STEM mode, just as it is in EFTEM. However, if the STEM probe size 
were kept constant and the magnification reduced, only a fraction of the pixel size 
would be irradiated by the probe; radiation damage would be higher than necessary, 
with regions between scan lines or probe positions (for a digital scan) left unirra- 
diated. In other words, the specimen would be undersampled by the probe. One 
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solution is to increase the probe diameter as the magnification is lowered, although 
this may still leave some of the specimen unirradiated because the probe is circular 
and the pixels are square. An equally good solution, in the case of a digitally gener- 
ated scan, is to use sub-pixel scanning: for an image having Nx N pixels, waveforms 
whose frequency is N times larger and amplitude N times smaller are added to the 
line and frame scans. The very small probe is then scanned in a small square raster, 
covering each pixel area, so that severe undersampling of the specimen is avoided. 


2.6.6 Z-Contrast and Z-Ratio Imaging 


The scanning-transmission electron microscope has the advantage of being able 
to efficiently collect electrons that are scattered through large angles, by use of a 
high-angle annular dark-field (HAADF) detector. This high-angle elastic scattering 
approximates to Rutherford elastic scattering (Section 3.1.2), which ideally has a 
Z* dependence on atomic number Z, although slightly lower exponents are com- 
mon (Section 3.1.6). The STEM therefore provides Z-contrast images, useful for 
imaging clusters of heavy atoms on a lower-Z substrate (e.g., catalyst studies) and 
atomic columns in crystalline specimen (Pennycook and Jesson, 1991; Krivanek 
et al., 2010). Channeling of the incident electrons limits the lateral spreading of 
the beam in a crystalline specimen, allowing core-loss spectroscopy to identify the 
atomic number of each column (Browning et al., 1993b, 1999; Pennycook et al., 
1995a; Varela et al., 2004). This spectroscopy can be precisely simultaneous with 
the structural imaging because it makes use of electrons that pass through the central 
hole of the HAADF detector, as in Fig. 2.35. 

Z-ratio imaging refers to a technique first used by Crewe et al. (1975) to dis- 
play images of single atoms of Hg and U on a thin-carbon (<10 nm) support 
film. The ADF signal Jg was divided by the total-inelastic signal /; recorded 
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through a spectrometer, using a wide energy-selecting slit that simply eliminated 
the unscattered component Jy; see Fig. 2.35. A single-channel detector, such as a 
scintillator/PMT combination, is sufficient to measure J;. Because variations in mass 
thickness of the support contribute almost equally to 7g and J;, their unwanted effect 
largely disappears in the image formed from the ratio R = Jy /J;. Taking a ratio also 
cancels any fluctuations in probe current but does involve some increase in noise, 
particularly if the specimen is very thin. 

Because of its relatively broad angular distribution (Section 3.2.1), elastic scatter- 
ing provides the main contribution to Jy, whereas the main contribution to J, comes 
from electrons that suffer only inelastic scattering. If the specimen is amorphous 
and very thin, the elastic and inelastic scattering are both proportional to specimen 
thickness and to the appropriate atomic cross sections o¢ and oj. Assuming that all 
elastic scattering is recorded by the dark-field detector and ail inelastic scattering is 
captured by the spectrometer, the ratio signal is 


R= I[g/T, = 0/0; = Aj/e (2.54) 


where Ae and A; are mean free paths for elastic and inelastic scattering. The inten- 
sity in the ratio image is then proportional to the local value of the elastic/inelastic 
scattering ratio, which in theory and in practice (Section 3.2.1) is proportional to 
atomic number Z. 

In the case of specimens that are thicker than about )j;/2, the dark-field and 
inelastic signals are given more accurately (Lamvik and Langmore, 1977; Egerton, 
1982c) by 


Ia = I[1 — exp(—t/A<)] (2.55) 


f, = Lexp(—t/Ae)[1 — exp(—t/Ai)] (2.56) 


where / is the incident beam current and f is the local thickness of the specimen. The 
exponential functions in Eqs. (2.55) and (2.56) occur because of plural scattering, as 
a result of which neither /; nor /g is proportional to specimen thickness. For t/A; < 1, 
this nonlinearity can be removed by digital processing (Jeanguillaume et al., 1992). 
For #/); > 1, i, reaches a maximum and then decreases with increasing thickness, 
as many of the inelastically scattered electrons also undergo elastic scattering and 
therefore contribute to /g rather than to j;. 

Z-ratio imaging has been used to increase the contrast of thin (30-nm) sections 
of stained and unstained tissues (Carlemalm and Kellenberger, 1982; Reichelt et al., 
1984). If the sample is crystalline and the ADF detector accepts medium-angle scat- 
tering, both the elastic and inelastic images are strongly influenced by diffraction 
contrast, which may increase rather than cancel when the ratio is taken (Donald and 
Craven, 1979). 


Chapter 3 
Physics of Electron Scattering 


It is convenient to divide the scattering of fast electrons into elastic and inelastic 
components that can be distinguished on an empirical basis, the term elastic mean- 
ing that any energy loss to the sample is not detectable experimentally. This criterion 
results in electron scattering by phonon excitation being classified as elastic (or 
quasielastic) when measurements are made using an electron microscope, where 
the energy resolution is rarely better than 0.1 eV. The terms nuclear (for elastic scat- 
tering) and electronic (for inelastic scattering) would be more logical but are not 
widely used. 


3.1 Elastic Scattering 


Elastic scattering is caused by interaction of incident electrons with the electrostatic 
field of atomic nuclei. The atomic electrons are involved only to the extent that 
they terminate the nuclear field and therefore determine its range and magnitude. 
Because a nucleus is some thousands of times more massive than an electron, the 
energy transfer involved in elastic scattering is usually negligible. However, for the 
small fraction of electrons that are scattered through large angles (up to 180°), the 
transfer can amount to some tens of electron volts, as evidenced by the occurrence 
of displacement damage and electron-induced sputtering at high incident energies 
(Jenkins and Kirk, 2001). 

Although not studied directly by electron energy-loss spectroscopy, elastic 
scattering is relevant for the following reasons: 


1. Electrons can undergo both elastic and inelastic interactions within the sample, 
so elastic scattering modifies the angular distribution of the inelastically scattered 
electrons. 

2. In a crystalline material, elastic scattering can redistribute the electron flux 
(current density) within each unit cell and change the probability of localized 
inelastic scattering (see Section 3.1.4). 

3. The ratio of elastic and inelastic scattering can provide an estimate of the local 
atomic number or chemical composition of a specimen (Sections 2.6.6 and 
5.4.1). 
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112 3 Physics of Electron Scattering 
3.1.1 General Formulas 


A quantity of basic importance in scattering theory is the differential cross section 
do /d&2, representing the probability of an incident electron being scattered (per unit 
solid angle (2) by a given atom. For elastic scattering, one can write 


do/dQ = |f|? (3.1) 


where f is a (complex) scattering amplitude or atomic scattering factor, which is 
a function of the scattering angle 6 or scattering vector q. The phase component 
of f is important in high-resolution phase-contrast microscopy (Spence, 2009) but 
for the calculation of scattered intensity only the amplitude is required, as implied 
in Eq. (3.1). Within the first Born approximation (equivalent to assuming only sin- 
gle scattering within each atom), fis proportional to the three-dimensional Fourier 
transform of the atomic potential V(r). 

Alternatively, the differential cross section can be expressed in terms of an elastic 
form factor F(q): 


= IFO" = zGqlZ -A@l (3.2) 
a4 


dQ ag’ 
Here ap = Ar egh* /moe? = 0.529 x 107!° m is the first Bohr radius and y= 
(1 — v*/c*)~!/? is the relativistic factor tabulated in Appendix E; f,(q) is the atomic 
scattering factor (or form factor) for an incident x-ray photon and is equal to the 
Fourier transform of the electron density within the atom. The atomic number Z in 
Eq. (3.2) represents the nuclear charge and denotes the fact that incident electrons 
are scattered by the entire electrostatic field of an atom, whereas x-rays interact 
mainly with the atomic electrons. 


3.1.2 Atomic Models 


The earliest and simplest model for elastic scattering of charged particles is based 
on the unscreened electrostatic field of a nucleus and was first used by Rutherford 
to account for the scattering of a-particles (Geiger and Marsden, 1909). While an 
a-particle is repelled by the nucleus, an incident electron is attracted, but in either 
case, classical mechanics indicates that the trajectories are hyperbolic (Fig. 3.1). 
Both classical and wave-mechanical theory leads to the same expression for the 
differential cross section, which is obtained by setting the electronic term f,(q) to 
zero in Eq. (3.2), giving 


do/dQ = 4y?Z? /ajq* (3.3) 


Here gq is the magnitude of the scattering vector and is given by g = 2k sin(@/2), 
as illustrated in Fig. 3.2; i kg = ymov is the momentum of the incident electron; 
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Fig. 3.1 Rutherford 
scattering of an electron by 
the electrostatic field of an 
atomic nucleus, viewed from 
a classical (particle) 
perspective. Each value of 
impact parameter b gives rise 
to a different scattering angle 
6, and as b increases, 0 
decreases because the 
electron experiences a weaker 
electrostatic attraction. For 
small 6, da/dQ2 is 
proportional to 6 + 


and fi q is the momentum transferred to the nucleus. For lighter elements, Eq. (3.3) 
is a reasonable approximation at large scattering angles (see Fig. 4.24) and is use- 
ful for estimating rates of backscattering (9 > 2/2) in solids (Reimer, 1989). But 
since no allowance has been made for screening of the nuclear field by the atomic 
electrons, the Rutherford model greatly overestimates the elastic scattering at small 
@ (corresponding to larger impact parameter b) and gives an infinite cross section if 
integrated over all angles. 


Fig. 3.2 Vector diagram for elastic scattering, where ko and kj are the wavevectors of the fast 
electron before and after scattering. From geometry of the right-angled triangles, the magnitude of 
the scattering vector is g = 2kg sin(@/2). The direction of q has been chosen to represent momen- 
tum transfer to the specimen (opposite to the wavevector change of the fast electron), as normally 
required in equations that deal with the effects of elastic and inelastic scattering 
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The simplest way of incorporating screening is through a Wentzel (or Yukawa) 


formula, in which the nuclear potential is attenuated exponentially as a function of 
distance r from the nucleus: 


pr) = [Ze/(Areor)] exp(—r/ro) (3.4) 


where ro is a Screening radius. Equation (3.4) leads to the angular distribution: 


2 
do -*( Zz 47221 ax 
dQ aH \P +r") ako (2 +63) 
where 6) = (koro)~! is a characteristic angle of elastic scattering. The angular 


dependence for a mercury atom is shown in Fig. 3.3, and for a carbon atom in 
Fig. 3.5. The fraction of elastic scattering that lies within the angular range 0 < 0 < B 
is (for B < 1 rad) 


1 
oe(B) _ 1 (a 
CO 1+ [2koro sin(B/2)]~7 aa (: = #) (3.6) 


Following Wentzel (1927) and Lenz (1954), an estimate of rg can be obtained 
from the Thomas—Fermi statistical model, treating the atom as a free-electron gas, 
namely 


ro = aozZ!* (3.7) 


Fig. 3.3. Angular 

dependence of the differential 

cross section for elastic 

scattering of 30-keV electrons 

from a mercury atom, 1 
calculated using the Lenz ev 
model with a Wentzel 

potential (solid curve) and on (nmysr) 
the basis of Hartree-Fock 

(dotted curve), Hartree—Slater 04 
(chained curve), and 

Dirac-—Slater (dashed curve) 
wavefunctions. Dirac—Slater 

results are also shown for a 

single- and double-ionized 

atom. From Langmore et al. 1 10 100 
(1973), copyright Springer 


q (nm*) 
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Integrating Eq. (3.5) over all scattering angles gives 


T dd, A 2 
— &? on sino do = —*—-z4/3 = (1.87 x 10-4 m2)Z43 (y/o)? (3.8) 
9 dQ ke 


where v is the velocity of the incident electron and c is the speed of light in vacuum. 
For an element of low atomic number, Eq. (3.8) gives cross sections that are accurate 
to about 10%, as confirmed by measurement on gases (Geiger, 1964). For a heavy 
element such as mercury, the Lenz model underestimates small-angle scattering by 
an order of magnitude (see Fig. 3.3), due largely to the neglect of electron exchange; 
for 100-keV electrons Eq. (3.8) gives only about 60% of the value obtained from 
more sophisticated calculations (Langmore et al., 1973). Some authors use a coef- 
ficient of 0.885 in Eq. (3.7) or take ro = 0.9a9Z~'/+. However, the main virtue of 
the Lenz model is that it provides a rapid estimate of the angular dependence of 
scattering, as in the LENZPLUS program described in Appendix B. 

More accurate cross sections are achieved by calculating the atomic potential 
from an iterative solution of the Schrédinger equation, as in the Hartree—Fock 
and Hartree—Slater methods (Ibers and Vainstein, 1962; Hanson et al., 1964). 
Alternatively, electron spin and relativistic effects within the atom can be included 
by using the Dirac equation (Cromer and Waber, 1965), leading to the so-called 
Mott cross sections. Partial wave methods can be used to avoid the Born approxi- 
mation (Rez, 1984), which fails if Z approaches or exceeds 137 (v/c), in other words 
for heavy elements or low incident energies. 

Langmore et al. (1973) proposed the following equation for estimating the total 
elastic cross section of an atom of atomic number Z: 


(3.9) 


Oe 


_ La 10 az" [! Z 
7 (v/c)? 596(v/c) 


The coefficient and Z-exponent are based on Hartree—Slater calculations; the term 
in brackets represents a correction to the Born approximation. The accuracy of 
Eq. (3.9) is limited to about 30% because the graph of o, against Z is in reality 
not a smooth curve but displays irregularities that reflect the outer-shell structure of 
each atom; see Fig. 3.4. A compilation of elastic cross sections (da/dQ and oe) is 
given by Riley et al. (1975), based on relativistic Hartree-Fock wavefunctions. 

For an ionized atom, the atomic potential remains partially unscreened at large r, 
so do /dQ continues to increase with increasing impact parameter (decreasing @); 
see Fig. 3.3. As a result, the amount of elastic scattering can appreciably exceed that 
from a neutral atom, particularly in the range of low scattering angles (Anstis et al., 
1973; Fujiyoshi et al., 1982). 

The scattering theory just described is based on the properties of a single isolated 
atom. In a molecule, the cross section per atom is reduced at low scattering angles, 
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Fig. 3.4 Cross section o, for elastic scattering of 100-keV electrons, calculated by Humphreys 
et al. (1974). Curve a shows individual data points derived from Doyle—Turner scattering factors, 
based on Hartree-Fock wavefunctions. Lines b and c represent the Lenz model, with and without 
a multiplying factor of 1.8 applied to Eq. (3.8). Curves d and e give cross sections [oe — oe(B)] 
for elastic scattering above an angle f of 24 and 150 mrad, respectively. Note that the atomic-shell 
periodicity in the Z-dependence disappears as 6 becomes large and the scattering approximates 
to Rutherford collision with small impact parameter, where screening by outer-shell electrons is 
unimportant 


typically by 10-20%, as a result of chemical bonding (Fink and Kessler, 1967). In 
a crystalline solid, the angular dependence of elastic scattering is changed dramat- 
ically by diffraction effects but in an amorphous solid, diffraction is weak and an 
atomic model can be used as a guide to the magnitude and angular distribution of 
elastic scattering. As an alternative to describing the amount of scattering in terms of 
across section (o¢ per atom), one can use an inverse measure: Ae = (dena)! , where 
Ng 1s the number of atoms per unit volume of the specimen. In an amorphous mate- 


rial at least, the mean free path X. can be thought of as the mean distance between 
elastic collisions. 


3.1.3 Diffraction Effects 


In a crystalline material, the regularity of the atomic arrangement requires that the 
phase difference between waves scattered from each atom be taken into account by 
introducing a structure factor F(@) defined by 
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F(@) = Sf) exp(- iq - rj) (3.10) 


J 


Here, r; and f; are the coordinate and scattering amplitude of atom j, with q-r; the 
associated phase factor; the sum is carried out over all atoms (j = 1, 2, etc.) in the 
unit cell. Equation (3.10) can also be expressed in the form 


Fe) x f vir exp(— q-r) dt (3.11) 


where V(r) is the scattering potential and the integration is carried out over all vol- 
ume elements within the unit cell. Equation (3.11) indicates that the structure factor 
is related to the Fourier transform of the lattice potential. 

The intensity scattered in a direction @ relative to the incident beam is |F' (0)|? 
and peaks at values of 6 for which the scattered waves are in phase with one 
another. Each diffraction maximum (Bragg beam) can also be regarded as repre- 
senting “reflection” from atomic planes, whose spacing d depends on the Miller 
indices and unit-cell dimensions. Bragg reflection occurs when the angle between 
the incident beam and the diffracting planes coincides with a Bragg angle 6p 
defined by 


A = 2dsinéz (3.12) 


where 4 = 2z77/ko is the incident electron wavelength. The scattering angle @ is 
twice that of 0p, so Eq. (3.12) is equivalent (for small 0p, large v) to A = @d. For 
100-keV incident electrons, A = 3.7 pm and the scattering angles corresponding to 
Bragg reflection exceed 10 mrad in simple materials. Larger values of 6g correspond 
to reflection from planes of smaller separation or to higher order reflections whose 
phase difference is a multiple of 277. 

The Bragg-reflected beams can be recorded by a two-dimensional detector such 
as a CCD array. For a single-crystal specimen, the diffraction pattern consists of an 
array of sharp spots whose symmetry and spacing are closely related to the crys- 
tal symmetry and lattice constants. In the case of a polycrystalline sample whose 
crystallite size is much less than the incident beam diameter, random rotational aver- 
aging produces a diffraction pattern consisting of a series of concentric rings, rather 
than a spot pattern. 

The relative intensities of the lowest order Bragg beams (for the case of a thin 
diamond specimen) are shown in Fig. 3.5 and are seen to follow the overall trend 
predicted by a single-atom model. Distributions of inelastic scattering from the inner 
(K-shell) and outer-shell (valence) electrons are also shown and are seen to be much 
narrower in angular range. In fact, each Bragg beam generates inelastic scattering 
within the specimen, so each Bragg spot in a diffraction pattern is surrounded by 
a halo of inelastic scattering. This inelastic scattering can be removed by zero-loss 
filtering of the diffraction pattern, which sharpens the Bragg spots and reduces the 
background between them. 
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Fig. 3.5 Angular distribution of elastic scattering from a single C atom (solid curve) and a dia- 
mond crystal (vertical bars) whose crystallographic axes are parallel to the incident beam. For 
comparison, the angular distributions of inelastic scattering by outer-shell and K-shell electrons 
are shown as Lorentzian functions corresponding to energy losses of 35 and 300 eV, respectively. 


In a crystalline solid, it is difficult to apply the concept of a mean free path for 
elastic scattering; the intensity of each Bragg spot depends on the crystal orientation 
relative to the incident beam and is not proportional to crystal thickness. Instead, 
each reflection is characterized by an extinction distance &,, which is typically in 
the range of 25-100 nm for 100-keV electrons and low-order (small 6g) reflections. 
In the ideal two-beam case, where Eq. (3.12) is approximately satisfied for only one 
set of reflecting planes and where effects of inelastic scattering are negligible, 50% 
of the incident intensity is diffracted at a crystal thickness of t = €,/4 and 100% at 
t= &,/2. Fort > &,/2, the diffracted intensity decreases with increasing thickness 
and would go to zero at t = & (and multiples of & ) if inelastic scattering could be 
neglected. This oscillation of intensity gives rise to the “thickness” or Pendellésung 
fringes that can be seen in TEM images. 


3.1.4 Electron Channeling 


Solution of the Schrédinger equation for an electron moving in a periodic potential 
results in wavefunctions known as Bloch waves: plane waves whose amplitude is 
modulated by the periodic lattice potential. Inside the crystal, a transmitted elec- 
tron is represented by the sum of a number of Bloch waves, each having the same 
total energy. Because each Bloch wave propagates independently without change 
of form, this representation can be preferable to describing electron propagation 
in terms of direct and diffracted beams, whose relative amplitudes vary with the 
depth of penetration. In the two-beam situation referred to in Section 3.1.3, there are 
only two Bloch waves. The type-2 wave has its intensity maximum located halfway 
between the planes of Bragg reflecting atoms and propagates parallel to these planes. 
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The type | wave propagates in the same direction but has its current density peaked 
exactly on the atomic planes. Because of the attractive force of the atomic nuclei, 
the type | wave has a more negative potential energy and therefore a higher kinetic 
energy and larger wavevector than the type-2 wave. Because of this difference in 
wavevector between the Bloch waves, their combined intensity exhibits a “beating” 
effect, which provides an alternative but equivalent explanation for the occurrence 
of thickness fringes in the TEM image. 

The relative amplitudes of the Bloch waves depend on the crystal orientation. 
For the two-beam case, both amplitudes are equal at the Bragg condition, but if the 
crystal is tilted toward a “zone axis” orientation (the angle between the incident 
beam and the atomic planes being less than the Bragg angle) more intensity occurs 
in Bloch wave |. Conversely, if the crystal is tilted in the opposite direction, Bloch 
wave 2 becomes dominant. Away from the Bragg orientation the current density 
distributions of the Bloch waves become more uniform, so that they more nearly 
resemble plane waves. 

Besides having a larger wavevector, the type 1 Bloch wave has a greater proba- 
bility of being scattered by inelastic events that occur close to the center of an atom, 
such as inner-shell and phonon excitation (thermal-diffuse scattering). Electron 
microscopists refer to this inelastic scattering as absorption, meaning that the scat- 
tered electron is absorbed by the angle-limiting objective aperture that is commonly 
used in a TEM to enhance image contrast or limit lens aberrations. The effect is 
incorporated into diffraction-contrast theory by adding to the lattice potential an 
imaginary component vi = hv/21j, where ij is an appropriate inelastic mean free 
path. The variation of this “absorption” with crystal orientation is called anomalous 
absorption and is characterized by an imaginary potential Vi. In certain directions, 
the crystal appears more “transparent” in a bright-field TEM image; in other direc- 
tions it is more opaque because of increased inelastic scattering outside the objective 
aperture. This behavior is analogous to the Borrmann effect in x-ray penetration 
and similar in many respects to the channeling of nuclear particles through solids. 
Anomalous absorption is also responsible for the Kikuchi bands that appear in the 
background to an electron-diffraction pattern (Kainuma, 1955; Hirsch et al., 1977). 

The orientation dependence of the Bloch-wave amplitudes also affects the inten- 
sity of inner-shell edges visible in the energy-loss spectrum. As the crystal is tilted 
through a Bragg orientation, an ionization edge can become either more or less 
prominent, depending upon the location (within the unit cell) of the atoms being 
ionized, relative to those that lie on the Bragg-reflecting planes (Taft and Krivanek, 
1982a). Inner-shell ionization is followed by de-excitation of the atom, involving the 
emission of Auger electrons or characteristic x-ray photons. So as a further result of 
the orientation dependence of absorption, the amount of x-ray emission varies with 
crystal orientation, provided the incident beam is sufficiently parallel (Hall, 1966; 
Cherns et al., 1973). This variation in x-ray signal is utilized in the ALCHEMI 
method of determining the crystallographic site of an emitting atom (Spence and 
Taftd, 1983). 

In a more typical situation in which a number of Bragg beams are excited simul- 
taneously, there are an equally large number of Bloch waves, whose current density 
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Fig. 3.6 Amplitude of the fast-electron wavefunction (square root of current density) for (a) a 100- 
keV STEM probe randomly placed at the {111} surface of a silicon crystal, (b) these electrons after 
penetration to a depth of 9.4 nm, and (ce) after penetration by 47 nm. Channeling has concentrated 
the electron flux along (111) rows of Si atoms. From Loane et al. (1988), copyright International 
Union of Crystallography, available at http://journals.iucr.org/ 


distribution can be relatively complicated. The current density associated with each 
Bloch wave has a two-dimensional distribution when viewed in the direction of 
propagation; see Fig. 3.6. 

Decomposition of the total intensity of the beam-electron wavefunction into a 
sum of the intensities of separate Bloch waves is a useful approximation for chan- 
neling effects in thicker crystals. In thin specimens, however, this independent Bloch 
wave model fails because of interference between the Bloch waves, as observed 
by Cherns et al. (1973). For even thinner crystals (t < &,) the current density is 
nearly uniform over the unit cell and the incident electron can be approximated 
by a plane wave, allowing atomic theory to be used to describe inelastic processes 
(Section 3.6). 


3.1.5 Phonon Scattering 


Because of thermal (and zero-point) energy, atoms in a crystal vibrate about their 
lattice sites and this vibration acts as a source of electron scattering. An equiv- 
alent statement is that the transmitted electrons generate (and absorb) phonons 
while passing through the crystal. Since phonon energies are of the order of kT 
(k = Boltzmann’s constant, T = absolute temperature) and do not exceed kTp 
(Tp = Debye temperature), the corresponding energy losses (and gains) are below 
0.1 eV and are not resolved by the usual electron microscope spectrometer sys- 
tem. There is, however, a wealth of structure in the vibrational-loss spectrum, which 
has been observed using reflected low-energy electrons (Willis, 1980; Ibach and 
Mills, 1982) and by high-resolution transmission spectroscopy (Geiger et al., 1970; 
Fig: 1,9). 
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Except near the melting point, the amplitude of atomic vibration is small com- 
pared to the interatomic spacing and (following the uncertainty principle) the 
resulting scattering has a wide angular distribution. Particularly, in the case of 
heavier elements, phonon scattering provides a major contribution to the dif- 
fuse background of an electron-diffraction pattern. This extra scattering occurs 
at the expense of the purely elastic scattering; the intensity of each elastic 
(Bragg-scattered) beam is reduced by the Debye—Waller factor: exp(—2M), where 
M = 2(2m sin 0g /A7)(u’), A is the electron wavelength, and (u7) is the component of 
mean-square atomic displacement in a direction perpendicular to the corresponding 
Bragg-reflecting planes. 

The total phonon-scattered intensity (integrated over the entire diffraction plane) 
is specified by an absorption coefficient 9 = 2Vj4 /(fv), where Vi is the phonon 
contribution to the imaginary potential and v is the electron velocity. An inverse 
measure is the parameter & /2m = 1/0, which is roughly equivalent to a mean free 
path for phonon scattering. Typical values of 1/j19 are shown in Table 3.1. Unlike 
other scattering processes, phonon scattering is appreciably temperature dependent, 
increasing by a factor of 2-4 between 10 K and room temperature. Like elastic 
scattering, it increases with the atomic number Z of the scattering atom, roughly as 
Z!2. see Table 3.1. 

Rez et al. (1977) calculated the phonon-loss intensity distribution in the diffrac- 
tion pattern of 100-keV electrons transmitted through 100-nm specimens of Al, Cu, 
and Au. Their results show that the phonon-loss intensity is sharply peaked around 
each Bragg beam (FWHM < 0.1 mrad) but with broad tails that overlap and con- 
tribute a background to the diffraction pattern, a situation quite similar to that arising 
from the inelastic scattering due to valence-shell excitation (Fig. 3.5). Because the 
energy transfer is a small fraction of an electron volt, phonon scattering is con- 
tained within the zero-loss peak and is not removed from TEM images or diffraction 
patterns by energy filtering. 

Inelastic scattering involving plasmon and single-electron excitation is also con- 
centrated at small angles around each Bragg beam. However, typically 20% or more 
occurs at scattering angles above 7 mrad (Egerton and Wong, 1995), so inelastic 
scattering from the atomic electrons also makes a substantial contribution to the dif- 
fuse background of an electron-diffraction pattern. Because it involves an average 
energy loss of some tens of electron volts, this electronic component is removed 
by energy filtering. Unlike phonon scattering, it varies only weakly with atomic 
number, and for light elements (Z < 13) it makes a larger contribution than phonon 
scattering to the diffraction pattern background (Eaglesham and Berger, 1994). 


Table 3.1 Values of phonon mean free path (1/jz0, in nm) calculated by Hall and Hirsch (1965) 
for 100-keV electrons 


T (K) Al Cu Ag Au Pb 


300 340 79 42 20 19 
10 670 175 115 57 62 
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3.1.6 Energy Transfer in Elastic Scattering 


We now revert to an atomic model of elastic scattering and consider an electron (rest 
mass mo) of kinetic energy Eo deflected through an angle @ by the electrostatic field 
of a stationary nucleus (mass M). Conservation of energy and momentum requires 
that the electron must transfer to the nucleus an amount of energy E given by 


E = Emax sin’(/2) = Emax(1 — cos 6)/2 (3.12a) 
Emax = 2Eo(Eo + 2moc*)/(Mc’) (3.12b) 


Here Emax is the maximum possible energy transfer, corresponding to 0 = z rad. 
For a typical TEM incident energy (~ 100 keV) and the scattering angles involved 
in electron diffraction and TEM imaging (< 0.1 rad), Eq. (3.12a) gives E < 0.1 eV; 
therefore, the energy loss is not measurable in a TEM-EELS system. If the atom is 
part of a solid, this small amount of energy transfer can be used to generate phonons. 

In the case of a high-angle collision, EF approaches Emax, whose value depends on 
Eo and on the atomic weight (or atomic number) of the target atom but ranges from 
a few electron volts for heavy atoms to several tens of electron volts for light atoms. 
The energy loss associated with elastic backscattering (9 > m/2 rad) of 40-keV 
electrons can be measured and has been proposed as an analytical method (Went 
and Vos, 2006). 

A high-angle collision is comparatively rare but if E exceeds the displacement 
energy of an atom (20 eV for elemental copper and 80 eV for diamond), it gives 
rise to knock-on displacement damage in a crystalline specimen, atoms being dis- 
placed from their lattice site into an interstitial positions, to form vacancy—interstitial 
(Frenkel) pairs; for example, Oen (1973) and Hobbs (1984). Alternatively, a 
high-angle collision with a surface atom may transfer energy in excess of the 
surface-displacement energy (generally below 10 eV), giving rise to electron- 
induced sputtering (Bradley, 1988; Egerton et al., 2010). These processes occur only 
if the incident energy exceeds a threshold value given by 


Eo™ = moc*{L1 + (M/2mo)(Ea/moc’)]'/* — 1} 

= (511 keV){[1 + AEg/(561 eV)]!/2 — 1} Ge) 
where A is the atomic weight (mass number) of the target atom and Ey is the bulk 
displacement or surface-binding energy. 

At high scattering angles, the screening effect of the atomic electrons is small and 
the differential cross section for elastic scattering is close to the Rutherford value, 
Eq. (3.3). The Rutherford formula can be integrated between energy transfers Emax 
and Emin to yield a cross section: 


oar = (0.250 barn)F Z7[(Emax/Emin) — 1] (3.12d) 
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where | barn = 10778 m? and F = (1 —v?/c*)/(v4/c*) is a relativistic factor. Using 
Emin = Ea and Emax given by Eq. (3.12b), a cross section for the displacement 
process can be obtained from Eq. (3.12d). In the case of sputtering, the specimen- 
thinning rate can then be estimated as Jeogr monolayers per second for a current 
density as J. electrons per area per second. 

In principle, greater accuracy is achieved by using Mott cross sections that 
include the effects of electron spin. Such cross sections have been tabulated by Oen 
(1973) and Bradley (1988). For lighter elements (Z < 28) an approximation due to 
McKinley and Feshbach (1948) can be used: the Rutherford cross section is mul- 
tiplied by a correction factor and can be integrated analytically to give (Banhart, 
1999) 


o4 = (0.250 barn)[(1 — B?)/B*]{X + 2708 X'/* —[14+2 raB+(B* +2aB) In(X)]} 
(3.12e) 


where a = Z/137, 8 = v/c, and X = Emax/Emin = sin?(@max/2)/sin?(@min/2). 
For light elements, Eq. (3.12e) gives cross sections close to the true Mott cross 
sections but for heavier elements (Z > 28) the values are too low and the Rutherford 
formula gives a better approximation. A computer program SIGDIS that evaluates 
Eqs. (3.12d) and (3.12e) is described in Appendix B. 

In the case of electron-induced sputtering, it is possible that only the compo- 
nent of transferred energy perpendicular to the surface is used to remove a surface 
atom, which corresponds to a planar escape potential rather than a spherical one. 
The Rutherford and McKinley—Feshbach—Mott cross sections are then obtained by 
using Emin = (Ea Buns) rather than Emin = Ea in Eq. (3.12d) or (3.12e). A typ- 
ical situation probably lies somewhere between these two extremes, depending on 
the directionality of the atomic bonding, for example, Egerton et al. (2010). The 
MATLAB program SIGDIS described in Appendix B calculates Rutherford and 
Mott cross sections for both of these escape potentials. 

Because Eq. (3.12a) relates each energy loss to a particular scattering angle 0, 
the integrated Rutherford cross section can alternatively be expressed in terms of 
the minimum and maximum scattering angles involved: 


oar = (0.250 barn)F Z7[(sin?@max/2)/sin?(Omin/2) — 1] (3.12f) 


The equivalent McKinley—Feshbach—Mott cross section is given by Eq. (3.12e) with 
X= sin? (Omax /2)/ sin? (Omin /2). Such cross sections ignore screening of the atomic 
nucleus and are valid only if Omin considerably exceeds some characteristic angle, 
given by the Lenz model as 09 = Z 1/3 /koao. For 60 keV electrons, 09 = 27 mrad 
for Z = 6 and 66 mrad for Z = 92. These angles are also large enough to ensure 
that diffraction effects do not greatly affect the angle-integrated signal. A computer 
program SIGADF that evaluates og and ogp as a function of angle is described in 
Appendix B. 
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The high-angle elastic cross sections relate directly to the signal (Jg electrons/s) 
received by a high-angle annular dark-field (HAADF) detector when an electron 
beam (J electrons/s) passes through a STEM specimen (N atoms per unit area): 


Ig=NTIoa (3.12g) 


In the case of an atomic-sized electron probe, Eq. (3.12g) allows discrimination 
between atoms of different atomic numbers, based on the HAADF signal (Krivanek 
et al., 2010). For a large inner angle 6 min, the scattering is Rutherford like and Ig « 
Z?, as seen in Fig. 3.4. For Onin = O and large Omax, la « ZA3 according to 
the Lenz model, Eq. (3.8). For a typical HAADF detector (Qmin = 60 mrad and 
Omax = 200 mrad), [4 « Z!-%* has been observed for Z < 12 (Krivanek et al., 2010). 


3.2 Inelastic Scattering 


As discussed in Chapter 1, fast electrons are inelastically scattered by electrostatic 
interaction with both outer- or inner-shell atomic electrons, processes that predom- 
inate in different regions of the energy-loss spectrum. Before considering inelastic 
scattering mechanisms in detail, we deal briefly with theories that predict the total 
cross section for inelastic scattering by the atomic electrons. In light elements, outer- 
shell scattering makes the largest contribution to this cross section. For aluminum 
and 100-keV incident electrons, for example, inner shells represent less than 15% of 
the total-inelastic cross section, although they contribute over 50% to the stopping 
power and secondary electron production (Howie, 1995). 


3.2.1 Atomic Models 


For comparison with elastic scattering, we consider the angular dependence of 
inelastic scattering (integrated over all energy loss) as expressed by the differen- 
tial cross section do;/dQ. By modifying Morse’s theory of elastic scattering, Lenz 
(1954) obtained a differential cross section that can be written in the form (Reimer 
and Kohl, 2008) 


doj _ Ay?Z i 1 (3.13) 
dQ agg [1 + (qro)°T 


where y? = (1- vie and ag = 0.529 x 10~!° m, the Bohr radius; ro is a 
screening radius, defined by Eq. (3.4) for a Wentzel potential and equal to ajZ~!/3 
according to the Thomas—Fermi model. The magnitude gq of the scattering vector is 
given approximately by the expression 


¢ = K(6" + 62) (3.14) 
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in which kg = 22/4 = ymov/h is the magnitude of the incident electron 
wavevector, @ is the scattering angle, and 6g = E/(ymov’) is a characteristic angle 
corresponding to an average energy loss E. Comparison with Eq. (3.3) shows that 
the first term (4y7Z/ aq’) in Eq. (3.13) is the Rutherford cross section for scattering 
by Z atomic electrons, taking the latter to be stationary free particles. The remaining 
term in Eq. (3.13) is an inelastic form factor (Schnatterly, 1979). 

Equations (3.13) and (3.14) can be combined to give a more explicit expression 
for the angular dependence (Colliex and Mory, 1984): 


do, _4y*Z 1 1 0% (3.15) 
dQ agky (92 + 62) 6? + 02 + 65 


where 6) = (kgro)~! as in the corresponding formula for elastic scattering, Eq. (3.5). 
Taking rp = agZ~'/3 leads to the estimates O¢ ~ 0.2 mrad and 6) * 20 mrad for a 
carbon specimen, taking Ey = 100 keV and E * 37 eV (Isaacson, 1977). 

In the angular range 6g < @ < 60, which contains most of the scattering, do /dQ 
is roughly proportional to 1/6 7, whereas above 60 it falls off as 1/0 +; see Fig. 3.7. 
The differential cross section therefore approximates to a Lorentzian function with 
an angular width 6, and a gradual cutoff at 9 = 6. 

On the basis of Bethe theory (Section 3.2.2), cutoff would occur at a mean Bethe 
ridge angle 6, ~ ./(2z). In fact, these two angles are onen quite close to one 
another; for carbon and 100-keV incident electrons, %) ~*~ 6, * 20 mrad. Using 
this value as a cutoff angle in Eqs. (3. 53) and (3.56), the mean ey median angles of 
inelastic scattering are estimated as 6 ~ 200¢ ~ 4 mrad and Ox 106” ~ 2 mrad for 
carbon and 100-keV incident electrons, respectively. Inelastic scattering is therefore 
concentrated into considerably smaller angles than elastic scattering, as seen from 
Figs. 3.5 and 3.7. 


Fig. 3.7. Angular 10 
dependence of the differential 

cross sections for elastic and | 10 
inelastic scattering of de 
100-keV electrons from a dn 
carbon atom, calculated using °"™”*” 
the Lenz model (Eqs. (3.50, 10° 
(3.7), and (3.15)). Shown 

along the horizontal axis are 10° 
(from left to right) the 

characteristic, median, mean, 10° 
root-mean-square and 
effective cutoff angles for 
total inelastic scattering, 
evaluated using Eqs. (3.53), H 
(3.54), (3.55), and (3.56) 2 w 10 (rraa)—+ 100 
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Integrating Eq. (3.15) up to a scattering angle 6 gives the integral cross section: 


oi(B) © (3.16) 


8ry?Z'/3__ | (B* + 62)(65 + Op) 
ks O2(B? + 62 + 62) 


Extending the integration to all scattering angles, the total inelastic cross section is 
Hj © (l6ry?Z'/3/k) In(o/Oz) © (8ry?Z'/?/kG) In(2/Oz) (3.17) 


where the Bethe ridge angle (26z)'/? has been used as the effective cutoff angle 00 
(Colliex and Mory, 1984). Comparison of Eqs. (3.8) and (3.17) indicates that 


oi/o¢ © 21n(2/6g)/Z = C/Z (3.18) 


where the coefficient C is only weakly dependent on atomic number Z and inci- 
dent energy Ey. Atomic calculations (Inokuti et al., 1981) suggest that (for Z < 40) 
E varies by no more than a factor of 3 with atomic number; a typical value is 
E = 40 eV, giving C ~ 17 for 50-keV electrons and C ~ 18 for 100-keV 
electrons. Experimental measurements on solids agree surprisingly well with these 


predictions; see Fig. 3.8. This simple Z-dependence of the inelastic/elastic scattering 


Fig. 3.8 Measured values of : © Egerton, 1976a 
inelastic/elastic scattering 
ratio for 80-keV electrons, as © Reimer and Ross-Messemer (1990) 
a function of atomic number 
of the specimen. The solid 
line represents Eq. (3.18) 
with C = 20 
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ratio has been used to interpret STEM ratio images of very thin specimens; see 
Section 2.6.6. 

Measurements and more sophisticated atomic calculations of o; differ from 
Eq. (3.17) because of the variable number of outer-shell (valence) electrons, which 
contribute a large part of the scattering, especially in light elements. Instead of a 
simple power-law Z-dependence, the inelastic cross section reaches minimum val- 
ues for the compact, closed-shell (rare gas) atoms and maxima for the alkali metals 
and alkaline earths, where the weakly bound outer electrons contribute a substantial 
plasmon component; see Fig. 3.9. 


Closely related to the total inelastic cross section is the electron stopping power 
S, defined by (Inokuti, 1971) 


dE = 
S= — =n,Eo; (3.19) 
dz 


where F represents energy loss, z represents distance traveled through the specimen, 
E is amean energy loss per inelastic collision, and ng is the number of atoms per unit 
volume of the specimen. Atomic calculations (Inokuti et al., 1981) show that E has 
a periodic Z-dependence that largely compensates that of oj, giving S a relatively 
weak dependence on atomic number. 

In low-Z elements, inner atomic shells contribute relatively little to 0; (Ritchie 
and Howie, 1977), but they do have an appreciable influence on the stopping power 


@ Oj (Mbarn) 
O° Op (Mbarn) 


0 10 20 30 40 50 60 70 80 90 
atomic number Z 


cross section (cm? x 10°'8) 
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Fig. 3.9 Total-inelastic cross section oj and plasmon (outer-shell) cross section op for 200-keV 
electrons, based on EELS data of Iakoubovskii et al. (2008b). The measurements include all scat- 
tering up to 20 mrad but none beyond 40 mrad, so oj; will be too low for the heavier elements 
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(Howie, 1995) because the energy losses involved are comparatively large. For 
heavy elements, inner atomic shells make major contributions to both oj and S. 


3.2.2 Bethe Theory 


In order to describe in more detail the inelastic scattering of electrons by an atom 
(including the dependence of scattered intensity on energy loss), the behavior of 
each atomic electron is specified in terms of transition from an initial state of wave- 
function Wo to a final state of wavefunction y,,. Using the first Born approximation, 
the differential cross section for the transition is (Inokuti, 1971) 


2 


/ Vir) Wow; exp(iq - r) dt (3.20) 


aia > (eat) 


~ \onke] Ky 


In Eq. (3.20), ko and k; are wavevectors of the fast electron before and after scat- 
tering, hq = h(ko — k,) is the momentum transferred to the atom, r represents 
the coordinate of the fast electron, V(r) is the potential (energy) responsible for the 
interaction, and the asterisk denotes complex conjugation of the wavefunction; the 
integration is over all volume elements dt within the atom. 

Below an incident energy of about 300 keV (see Appendix A), the interaction 
potential that represents electrostatic forces between an incident electron and an 
atom can be written as 


Ze 1 2 e 
V(r) = (3.21) 
2 2 Fal 


4nreor 47 


Although generally referred to as a potential, V(r) is actually the negative of the 
potential energy of an electron and is related to the electrostatic potential @ by 
V=e¢@. 

The first term in Eq. (3.21) represents Coulomb attraction by the nucleus, 
charge = Ze; the second term is a sum of the repulsive effects from each atomic elec- 
tron, coordinate r;. Because the initial and final state wavefunctions are orthogonal, 
the nuclear contribution integrates to zero in Eq. (3.20), so whereas the elastic cross 
section reflects both nuclear and electronic contributions to the potential, inelastic 
scattering involves only interaction with the atomic electrons. Because the latter are 
comparable in mass to the incident electron, inelastic scattering involves apprecia- 
ble energy transfer. Combining Eqs. (3.20) and (3.21), the differential cross section 
can be written in the form 


doy Ay? \ ky 
= 3.22 
dQ (35) i len(q)| ( ) 
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where the first term in parentheses is the Rutherford cross section for scattering 
from a single free electron, obtained by setting Z = 1 in Eq. (3.3). The second term 
(k,/ko) is very close to unity when the energy loss is much less than the incident 
energy. The final term in Eq. (3.22), known as an inelastic form factor or dynamical 
structure factor, is the square of the absolute value of a transition matrix element 
defined by 


= / Van > explig - rWodt = (HI dS exptiq: sv) (3.23) 
j j 


Like the elastic form factor of Eq. (3.2), |én (q)|* is a dimensionless factor that 
modifies the Rutherford scattering that would take place if the atomic electrons were 
free; it is a property of the target atom and is independent of the incident electron 
velocity. 

A closely related quantity is the generalized oscillator strength (GOS), given by 
(Inokuti, 1971) 


_ En lén(l? 
MD= = faa? 


where R = (moe*/2)(4meoh)~? = h*/(2moa”) = 13.6 eV is the Rydberg energy 
and E,, is the energy change associated with the transition. The differential cross 
section can therefore be written in the form 


(3.24) 


don 4y?Rk 
= = 3.25 
18” Bae n(Q) (3.25) 


where (k; /ko) © 1—2E,, (mov?) ! can usually be taken as unity. In the limit g > 0, 
the generalized oscillator strength f,(q) reduces to the dipole oscillator strength f, 
that characterizes the response of an atom to incident photons (optical absorption). 

In many cases (e.g., ionizing transitions to a “continuum” of states) the energy- 
loss spectrum is a continuous rather than discrete function of the energy loss EF, 
making it more convenient to define a GOS per unit excitation energy (i.e., per unit 
energy loss): df (q, E) /dE. The angular and energy dependence of scattering are 
then specified by a double-differential cross section: 


Po  4y°R ky df 
dQdE~ Eq? ky dE 


(q,E) (3.26) 


To obtain explicitly the angular distribution of inelastic scattering, the scattering 
vector gq must be related to the scattering angle 9. For? « | rad and E < Ep, where 
Ep is the incident beam energy, it is a good approximation to take the components 
of q as ko@ and koOg (see p. 192, Fig. 3.39) and to write 


¢ © k(6" + 62) (3.27) 
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where the characteristic angle 6 is defined as 


E E 
OE = 


= = 3.28 
ymov? — (Eg + moc?)(v/c)? re 


and v is the speed of the incident electron. The factor y = m/mpo takes account of the 
relativistic increase in mass of the incident electron. A nonrelativistic approximation 
is 9g = E/(2Eo), which gives characteristic angles 8, 14, and 18% too low at Eo = 
100, 200, and 300 keV, respectively. Equation (3.26) can now be written as 


Po — 4y?R 1 df 8agR? 1 df (3.29) 
dQdE~ Ek \ 62462] dE Emov? \ 62 + 02} dE 


At low scattering angles, the main angular dependence in Eq. (3.29) comes from the 
Lorentzian (62 + 6,2)~! factor. The importance of the characteristic angle 0 ¢ is that 
it represents the half-width at half maximum (HWHM) of this Lorentzian function. 
The regime of small scattering angle and relatively low energy loss, where df/dE is 
almost constant (independent of g and @), is known as the dipole region. 

For typical TEM incident energies (e.g., Ep = 100 keV), the width of the 
inelastic angular distribution is quite small (FWHM = 26¢ ~ 0.1 for outer-shell 
excitation, typically a few milliradians for inner-shell excitation) and considerably 
less than the angular width of elastic scattering; see Fig. 3.5. Consequently, inelas- 
tic scattering broadens only slightly the diffraction spots or rings in the diffraction 
pattern recorded from a crystalline specimen. However, the Lorentzian function has 
long tails and half of the outer-shell excitation corresponds to angles greater than 
about 10 6g (see Fig. 3.15), so inelastic scattering arising from electronic excitation 
can contribute substantially to the background of an electron-diffraction pattern. 


3.2.3 Dielectric Formulation 


The equations given in Section 3.2.2 are most readily applied to single atoms or to 
gaseous targets, in the sense that the required generalized oscillator strength can be 
calculated (as a function of g and E) using an atomic model. Even so, Bethe theory 
is useful for describing the inelastic scattering that takes place in a solid, particu- 
larly from inner atomic shells. Outer-shell scattering is complicated by the fact that 
the valence-electron wavefunctions are modified by chemical bonding. In addition, 
collective effects are important, involving many atoms. An alternative approach is 
to describe the interaction of a transmitted electron with the entire solid in terms of 
a dielectric response function &(q, w). Although the latter can be calculated from 
first principles in only a few idealized cases, the same response function describes 
the interaction of photons with a solid, so this formalism allows energy-loss data to 
be compared with the results of optical measurements. 

Ritchie (1957) derived an expression for the electron scattering power of an 
infinite medium. The transmitted electron, having coordinate r and moving with a 
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velocity v in the z-direction, is represented as a point charge —ed(r — vf) that gener- 
ates within the medium a spatially dependent, time-dependent electrostatic potential 
g(r, t) satisfying Poisson’s equation: 


ege(q, ©) V(r, t) = ed(r, f) 


The stopping power (dE/dz) is equal to the backward force on the transmitted elec- 
tron in the direction of motion and is also the electronic charge multiplied by the 
potential gradient in the z-direction. Using Fourier transforms, Ritchie showed that 


dE Qn .o Im[—1/e(q, 
= ; / / geen’ ae o)| dqy do (3.30) 
dz maymov qi + (w/v) 


where the angular frequency is equivalent to E/h and qy is the component 
of the scattering vector in a direction perpendicular to v. The imaginary part of 
[—1/e(q, @)] 1s known as the energy-loss function and provides a complete descrip- 
tion of the response of the medium through which the fast electron is traveling. The 
stopping power can be related to the double-differential cross section (per atom) for 
inelastic scattering by 


“ a pt ?_goar (3.31) 
dz Ma” IQUE 


where n, represents the number of atoms per unit volume of the medium. For small 
scattering angles, dqy © ko@ and dQ ~ 276dé, so Eqs. (3.30) and (3.31) give 


a Im[—1/e(q, E 1 
o _ Im[-1/e@,)] aan) 
dQdE —x?aymovn,\ 6? + 02. 
where 6g = E/(ymov*) is the characteristic angle, as before. Equation (3.32) 


contains the same Lorentzian angular dependence and the same v~? factor as the cor- 
responding Bethe equation, Eq. (3.29). Comparison of these two equations indicates 
that the Bethe and dielectric expressions are equivalent if 


af peo 84 Bei 3.33 
aE = saan — 


where E? = hi? nge?/(e9mo), Eq being a “plasmon energy” corresponding to one free 
electron per atom (see Section 3.3.1). 

In the small-angle dipole region, e(g, E) varies little with g and can be replaced 
by the optical value e(0, E), which is the relative permittivity of the specimen at an 
angular frequency w = E/h. An energy-loss spectrum that has been recorded using 
a reasonably small collection angle can therefore be compared directly with opti- 
cal data. Such a comparison involves a Kramers—Kronig transformation to obtain 
Re[1/e(0, E)], leading to the energy dependence of the real and imaginary parts 
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(€; and €2) of (0, E), as discussed in Section 4.2. At large energy loss, €2 is small 
and ¢; close to 1, so that Im(—1/e) = €2/(e7 + gay? becomes proportional to 
€9 and (apart from a factor of E~*) the energy-loss spectrum is proportional to the 
x-ray absorption spectrum. 

The optical permittivity is a transverse property of the medium, in the sense 
that the electric field of an electromagnetic wave displaces electrons in a direc- 
tion perpendicular to the direction of propagation, the electron density remaining 
unchanged. On the other hand, an incident electron produces a longitudinal dis- 
placement and a local variation of electron density. The transverse and longitudinal 
dielectric functions are precisely equal only in the random-phase approximation 
(see Section 3.3.1) or at sufficiently small g (Nozieres and Pines, 1959); neverthe- 
less, there is no evidence for a significant difference between them, as indicated 
by the close similarity of Im(—1/e) obtained from both optical and energy-loss 
measurements on a variety of materials (Daniels et al., 1970). 


3.2.4 Solid-State Effects 


If Bethe theory is applied to inelastic scattering in a solid specimen, one might 
expect the generalized oscillator strength (GOS) to differ from that calculated for 
a single atom, due to the effect of chemical bonding on the wavefunctions and the 
existence of collective excitations (Pines, 1963). These effects change mainly the 
energy dependence of df/dE and of the scattered intensity; the angular dependence 
of inelastic scattering remains Lorentzian (with half-width @,), at least for small E 
and small @. 

Likewise, changes in the total inelastic cross section 0; are limited to a mod- 
est factor (generally <3) because the GOS is constrained by the Bethe sum rule 
(Bethe, 1930): 


df 
[Ger (3.34) 


The integral in Eq. (3.34) is over all energy loss EF (at constant g) and should be 
taken to include a sum over excitations to “discrete” final states, which in many 
atoms make a substantial contribution to the total cross section. If the sum rule is 
applied to a whole atom, z is equal to the total number Z of atomic electrons and 
Eq. (3.34) is exact. If applied to a single atomic shell, z can be taken as the number 
of electrons in that shell, but the rule is only approximate, since the summation 
should include a (usually small) negative contribution from “downward” transitions 
to shells of higher binding energy, which are in practice forbidden by the Pauli 
exclusion principle (Pines, 1963; Schnatterly, 1979). 

Using Eq. (3.33), the Bethe sum rule can also be expressed in terms of the energy- 
loss function: 


3.2 Inelastic Scattering 133 


| h2 d 2 
[im | a Ea (3.35) 
€(E) 2e0™mo 2 


where Ey = h(ne* /eqmo)!/? is a “plasmon energy” corresponding to the number of 
electrons, n per unit volume, that contribute to inelastic scattering within the range 
of integration. 

According to Eq. (3.29), the differential cross section (for 9 >> Og) is propor- 
tional to E~'df/dE, and dfldE is constrained by Eq. (3.34); therefore, the cross 
section oj (integrated over all energy loss and scattering angle) must decrease if 
contributions to the oscillator strength shift toward higher energy loss. This upward 
shift applies to most solids because the collective excitation of valence electrons 
generally involves energy losses that are higher than the average energy of atomic 
valence-shell transitions. As a rough estimate of the latter, one might consider the 
first ionization energy (Inokuti, personal communication); most often, the mea- 
sured valence-peak energy loss is above this value, as shown in Fig. 3.10, so the 
valence-electron contribution to a; is reduced when atoms form a solid. The result- 
ing decrease in 0; should be more marked for light elements where the valence shell 
accounts for a larger fraction of the atomic electrons and therefore makes a larger 
percentage contribution to the cross section (Inokuti et al., 1981). A fairly extreme 
example is aluminum, where the inelastic cross section per atom is a factor of about 
3 lower in the solid, in rough agreement with the ratio of the plasmon energy (15 eV) 
and the atomic ionization threshold (6 eV). 


10 20 30 40 50 


Fig. 3.10 First ionization energy Ej (open circles) and measured plasmon energy Ep (filled circles) 
as a function of atomic number. Where Ep > Ej, an atomic model is expected to overestimate the 
total amount of inelastic scattering. Crosses represent the plasmon energy calculated on a free- 
electron model with m = mo 
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Some solid compounds show less pronounced collective effects and the inelastic 
scattering from valence electrons retains much of its atomic character. As a rough 
approximation, each atom then makes an independent contribution to the scattering 
cross section. The effect of chemical bonding is to remove valence electrons from 
electropositive atoms (e.g., Na and Ca) and reduce their scattering power, whereas 
electronegative atoms (O, Cl, etc.) have their electron complement and scattering 
power increased. On this basis, the periodic component of the Z-dependence of o; 
(Fig. 3.9), which is related to the occupancy of the outermost atomic shell, should be 
less in the case of solids. In any event, what is measured experimentally is the sum of 
the scattering from all atoms (anions and cations); if the reductions and increases in 
scattering power are equal in magnitude, the total scattering power is simply the sum 
of the scattering powers calculated on an atomic model. This additivity principle 
(when applied to the stopping power) is known as Bragg’s rule and is believed to 
hold to within ~5% accuracy (Zeiss et al., 1977) except for any contribution from 
hydrogen, which is usually small anyway. It provides some justification for the use 
of atomic cross sections to calculate the stopping power and range of electrons in 
solids (Berger and Seltzer, 1982). 

Fano (1960) suggested that the extent of collective effects depends on the value 
of the dimensionless parameter: 


ve.) df (=) 1 df (3.36) 


a ( eom | dE2)  \2eomp / EdE 

where n is the number of electrons per unit volume (with binding energies less 
than £) that can contribute to the scattering at an energy FE. Collective effects can 
be neglected if up < 1, but are of importance where up approaches or exceeds | 
within a particular region of the loss spectrum (Inokuti, 1979). Comparison with Eq. 
(3.33) shows that up = Im[—1/e(E)]/z, so the criterion for neglecting collective 
excitations becomes 


Im[—1/e(E)] « a (3.37) 


Equation (3.37) provides a convenient criterion for assessing the importance of 
collective effects, since any energy-loss spectrum that has been measured up to a 
sufficiently high energy loss can be normalized, using Eq. (3.35) or (4.27), to give 
the energy-loss function Im[—1/e], as described in Section 4.2. A survey of experi- 
mental data indicates that Im[—1/e] rises to about 30 in Al, 3 to 4 for InSb, GaAs, 
and GaSb (materials that support well-defined plasma oscillations), reaches 2.2 in 
diamond, and does not rise much above | in the case of Cu, Ag, Pd, and Au, where 
plasma oscillations are strongly damped (Daniels et al., 1970). 

Organic solids are similar in the sense that their energy-loss function generally 
reaches values close to | for energy losses around 20 eV (Isaacson, 1972a), imply- 
ing that both atomic transitions and collective effects contribute to their low-loss 
spectra. Aromatic compounds and those containing C = C double bonds also show 
a sharp peak around 6—7 eV, sometimes interpreted as a plasmon resonance of the 1 
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electrons. However, vapor-phase aromatic hydrocarbons give a similar peak, which 
must therefore be interpreted in terms of m — x* single-electron transitions (Koch 
and Otto, 1969). 

Ehrenreich and Philipp (1962) proposed more definitive criteria for the occur- 
rence of collective effects (plasma resonance), based on the energy dependence of 
the real and imaginary parts (¢; and €2) of the permittivity. According to these cri- 
teria, liquids such as glycerol and water, as well as solids such as aluminum, silver, 
silicon, and diamond, all respond in a way that is at least partly collective (Ritchie 
et al., 1989). 


3.3 Excitation of Outer-Shell Electrons 


Most of the inelastic collisions of a fast electron arise from interaction with elec- 
trons in outer atomic shells and result in an energy loss of less than 100 eV. In a 
solid, the major contribution comes from valence electrons (referred to as conduc- 
tion electrons in a metal), although in some materials (e.g., transition metals and 
their compounds) underlying shells of low binding energy contribute appreciable 
intensity in the 0-100 eV range. We begin this section by considering plasmon exci- 
tation, an important process in most solids and one that exhibits several features not 
predicted by atomic models. 


3.3.1 Volume Plasmons 


The valence electrons in a solid can be thought of as a set of coupled oscilla- 
tors that interact with each other and with a transmitted electron via electrostatic 
forces. In the simplest situation, the valence electrons behave almost as free parti- 
cles (although constrained by Fermi—Dirac statistics) and constitute a “free-electron 
gas,” also known as a “Fermi sea” or “jellium.” Interaction with the ion-core lattice 
is assumed to be a minor perturbation that can be incorporated phenomenologically 
by using an effective mass m for the electrons, rather than their rest mass mo, and by 
introducing a damping constant T or its reciprocal tT, as in the Drude theory of elec- 
trical conduction in metals. The behavior of the electron gas is described in terms of 
a dielectric function, just as in Drude theory. In response to an applied electric field, 
such as that produced by a transmitted charged particle, a collective oscillation of 
the electron density occurs at a characteristic angular frequency wp, and this resonant 
motion would be self-sustaining if there were no damping from the atomic lattice. 


3.3.1.1 Drude Model 


The displacement x of a “quasi-free’” electron (mass m) due to a local electric field 
E must satisfy the equation of motion: 
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mx + mx = —eE (3.38) 

For an oscillatory field: E = Eexp(—iat), Eq. (3.38) has a solution: 
x = (cCE/m\(w* + To)! (3.39) 
The displacement x gives rise to a polarization P = —enx = e9xE, where n is 


the number of electrons per unit volume and x is the electronic susceptibility, and 
Eq. (3.39) leads to 


ee Tenx —ne 1 7 0} ( 1 iT/o ) (3.39a) 
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The relative permittivity or dielectric function e(@) is then 


é ws iTos 
é(@) =e, +isg =1+yx=1 mat pee) (3.40) 


Here w is the angular frequency (rad/s) of forced oscillation and wp» is the natural or 
resonance frequency for plasma oscillation, given by 


@p = [ne”/(eom)]!/? (3.41) 


A transmitted electron represents a sudden impulse of applied electric field, contain- 
ing all angular frequencies (Fourier components). Setting up a plasma oscillation 
of the loosely bound outer-shell electrons in a solid is equivalent to creating a 
pseudoparticle of energy E, = h wp, known as a plasmon (Pines, 1963). 

Taking m = mo and writing electron density as n = zo/(uA) where z is the 
number of free (valence) electrons per atom, u is the atomic mass unit, A represents 
atomic weight, and p is the specific gravity (density in g/cm*) of the solid, the 
free-electron plasmon energy is conveniently evaluated as 


Ep = (28.82 eV) (zp/A)!/* (3.41a) 
For a compound, A becomes the molecular weight and z the number of valence 


electrons per molecule. 
In this free-electron approximation, the energy-loss function is given by 


=] ola2 
eo) ] ep te wo) + @>ry 


As shown by Eq. (3.32), Im(—1/e¢) represents the energy dependence of the inelastic 
intensity at an energy loss E = fi w (the energy-loss spectrum), so Eq. (3.42) can be 
written as 
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_ E2(Eh E(AEp)E2 
m| 1 _ a /T) 2 ( PES (3.43) 
e(E) (E? — E2)°+(Eh/t) (EB? — E32)’ + (EAE, 


where Ep is the plasmon energy and t = I/1 is a relaxation time. The energy-loss 
function Im(—1/¢) has a full width at half-maximum (FWHM) given by AEp = 
hY = h/t and reaches a maximum value of wp) at an energy loss given by 


Emax = [E, — (AEG/2)]'/* (3.43a) 


For a material such as aluminum, where the plasmon resonance is sharp (AE) = 
0.5 eV), the maximum is within 0.002 eV of Ep, but for the broad resonance found 
in carbon, Eq. (3.43a) implies that Emax is shifted to lower energy by about 2.1 eV. 
From Eq. (3.40), the energy at which ¢;(£) passes through zero with positive slope 
(see Fig. 3.11) is 


E(e, = 0) = [E, — (AED) 1"? (3.43b) 


This zero crossing is sometimes taken as evidence of a well-defined collective 
response in the solid under investigation. 

Although based on a simplified model, Eq. (3.43) corresponds well with the 
observed line shape of the valence-loss peak in materials with sharp plasmon peaks, 
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Fig. 3.11 Real and 
imaginary parts of the relative 
permittivity and the 
energy-loss function 
Im(—1/e), calculated using a 
free-electron (jellium) model 
with Ep = 15 eV and 

AEp = 4 eV (Raether, 1980), 
copyright Springer-Verlag 
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Fig. 3.12 Energy of the main valence-loss peak, as measured by EELS (filled circles) and as 
predicted by the free-electron formula, Eq. (3.41) with m = mo (hollow circles). Agreement is 
good except for transition metals and rare earths, where d- and f-electrons contribute to ionization 
edges with relatively low energy and high cross section 


such as silicon and germanium (Hinz and Raether, 1979). Even with m = mo, 
Eq. (3.41) provides fairly accurate values for the energy of the main peak in the 
energy-loss spectrum of many solids, taking 7 as the density of outer-shell electrons; 
see Fig. 3.12 and Table 3.2. 

The relaxation time t represents the time for plasma oscillations to decay in 
amplitude by a factor exp(— 1) = 0.37. The number of oscillations that occur within 
this time is @pt/(27) = 0.16 (AE)/Ep). Using experimental values of Ep and AEp, 
this number turns out to be 4.6 for aluminum, 2.3 for sodium, 0.7 for silicon, and 
0.4 for diamond, so the plasma oscillations tend to be highly damped, to a degree 
that depends on the band structure of the material (see Section 3.3.2). 

The double-differential cross section for plasmon scattering is obtained by 
substituting Eq. (3.43) into Eq. (3.32), giving 


Table 3.2 Plasmon energy E, of several materials that show sharp peaks in the low-loss spectrum, 
calculated using the free-electron formula and compared with measured values. The characteristic 
angle 0” , cutoff angle 0, [estimated from Eqs. (3.50) and (3.51)] and the plasmon mean free path 
[Eq. (3.58)] are for 100-keV incident electrons 


Material Ep (calc) (eV) Ep (expt) (eV) AE, (eV) Og (mrad) 6 (mrad) Ap (calc) (nm) 


Li 8.0 71 2.3 0.039 5.3 233 
Be 18.4 18.7 4.8 0.102 Fl 102 
Al 15.8 15.0 0.53 0.082 tal 119 
Si 16.6 16.5 3.7 0.090 6.5, 115 


K 4.3 3.7 0.3 0.020 4.7 402 
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saa ~ ( 2 - ) 2 22 = 2 2 2 (3.43b) 
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This expression can be integrated over scattering angle to give an energy-differential 
cross section for all scattering up to an angle #, on the assumption that the integra- 
tion remains within the dipole region (6 < 6, where 0, is the plasmon cutoff angle 
discussed below): 


In(1 + 67/07) (3.43c) 


do(p) ( 1 ) E(AEp)E, 

dE agmov2na (E2 a ny + (E AE») 
It is also useful to integrate Eqs. (3.43b) and (3.43c) over energy loss, but since 
the variable E occurs also in the 6¢* term, a simple analytic expression is possible 
only for zero damping (AE, = 0). In that case, the middle term in these equations 
becomes a delta function: (77/2)Epé(E — Ep), giving 


ds Ep I eas 
dQ 2magmov2n,g \ 62 + Gp , 
Pp 
~ — Pp are. 
op(B) ~ 5 5 In(1 + 8 /95,) (3.44a) 
domov2na 


where OE, = Ep/(ymov?) or OE, © E,/2Eo within 10% accuracy for Ep < 120 keV. 
In the case of a broad plasmon peak, where AE, ~ 0 is not a good approxima- 
tion, the integration over energy loss must be done numerically. For the Drude model 
E-dependence, Eq. (3.44a) is found to overestimate the cross section, by about 1% 
for aluminum (AE, * 0.5 eV) and about 6% for silicon (AEp = 3.2 eV). Other 
cases can be investigated using the DRUDE program, discussed in Appendix B. 


3.3.1.2 The Plasmon Wake 


When a stationary charged particle is placed in a conducting medium, electrostatic 
forces cause the electron density to readjust around the particle in a spherically 
symmetric manner (screening by a “correlation hole’), reducing the extent of the 
long-range Coulomb field and minimizing the potential energy. When the parti- 
cle is moving at a speed v, an additional effect occurs as illustrated in Fig. 3.13 
(Echenique et al., 1979). Behind the particle, the potential and electron density 
oscillate at the plasmon frequency (wp rad/s), corresponding to spatial oscillation 
with a wavelength Ay = 27v/qp. These oscillations also spread laterally, defining 
a cone of semi-angle a * vp/v, where vp is the Fermi velocity in the medium. For a 
100-keV electron, this cone is narrow (a < 1°), as indicated by the different length 
scales for the radial and longitudinal distances in Fig. 3.13. 

Garcia de Abajo and Echenique (1992) showed that formation and destruction 
of the wake occurs within distances of approximately Aw/4 = (/2)(v/@p) of the 
entrance and exit surfaces of the specimen; the parameter v/@p is the same as Bohr’s 
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Fig. 3.13 (a) Response of medium (plasmon energy = 25 eV) to a moving charged particle, 
depicted in terms of the scalar potential calculated by Echenique et al. (1979), with axes relabeled 
to correspond to the case of a 100-keV electron. Oscillations along the z-axis (direction of electron 
travel) represent the wake potential, which gives rise to plasmon excitation. (b) Corresponding 
fractional change in electron density. This figure also reveals bow waves that start ahead of the 
particle and extend laterally as a paraboloidal pattern; they arise from small impact parameter col- 
lisions that generate single-electron excitations within the solid. For 100-keV electrons, the bow 
waves would be more closely spaced than shown, since their wavelength scales inversely with par- 
ticle speed. Based on Echenique et al. (1979), copyright American Physical Society. Available at 
http://link.aps.org/abstract/PRB/v20/p2567 


delocalization distance bmax; see Section 3.11. For Ep = 15 eV and Ep = 100 keV, 
v/@p © 7.2 nm. Because of this “dead layer” beneath the surface, the probability 
of bulk plasmon generation is reduced (the so-called begrenzungs effect), in com- 
pensation for the surface plasmon excitation that occurs at each surface; see also 
Fig. 3.25. 

Batson (1992b) pointed out that the charge-density and electric field fluctuations 
in the plasmon wake can excite electron transitions in the specimen, after the fast 
electron has passed, but that these single-electron excitations contribute to damping 
of the plasmon rather than additional energy loss. Batson and Bruley (1991) and 
Batson (1993c) have suggested that the form of the plasmon wake might account 
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for small differences in fine structure between x-ray absorption and K-loss edges of 
diamond and other insulators. 


3.3.1.3 Plasmon Dispersion: The Lindhard Model 


Equation (3.43) describes the energy dependence of the loss spectrum but applies 
only to small scattering vectors g (dipole region). The jellium model was first 
extended to higher g by Lindhard (1954), using the random-phase approximation 
(Sturm, 1982) and assuming Fermi statistics, but neglecting spin exchange and cor- 
relation effects arising from Coulomb interaction between the oscillating electrons. 
The Lindhard model leads to analytical expressions for e(g, E) and Im(—1/e) (Tung 
and Ritchie, 1977; Schnatterly, 1979; Schattschneider and Jouffrey, 1994). In the 
limit = 0, corresponding to completely free electrons, the plasmon energy Ep(q) 
at which ¢; passes through zero is given by the equations 


Ep(q) = Ep + & (h?/mo) @? (3.45) 
a = (3/5)ER/Ep (3.46) 
where Ef is the Fermi energy. Equation (3.45) is a dispersion relation for the 


plasmon, a being the dispersion coefficient. The increase in plasmon energy with 
increasing q (i.e., increasing scattering angle) can be seen in Fig. 3.14, where the 


Ime) 


10 20 E (eV) 


Fig. 3.14 Energy-loss function Im(—1/e¢) computed for silicon using the Lindhard model. Vertical 
arrows represent the volume plasmons. The plasmon dispersion curve enters the region of kine- 
matically allowed single-electron excitation at point P, which defines the cutoff wavevector qg,. For 
q >> 4c, the single-electron peak is reduced in intensity and is known as the Bethe ridge. From 
Walther and Cohen (1972), copyright American Physical Society. Available at http://link.aps.org/ 
abstract/PRB/v5/p3101 
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plasmon peaks are represented by delta functions (vertical arrows) since damping 
has been neglected. 

The Lindhard model can be extended to include plasmon damping (Mermin, 
1970; Gibbons et al., 1976) and insulating materials whose electron distribution 
is characterized by an energy gap (Levine and Louie, 1982). It is also possible to 
avoid the random-phase approximation (RPA) and include electron correlation, as 
first done by Nozieres and Pines (1959), who obtained a dispersion relation similar 
to Eq. (3.45) but with the dispersion coefficient given by 


2 
= 3ER [ = (32) (3.47) 
5Ep 4ER 


In the case of aluminum, a is reduced by 11% from its RPA value (0.45), giv- 
ing improved agreement with most measurements: for example, a = 0.38 + 0.02 
(Batson and Silcox, 1983). 

The qg-dependence is sometimes used to test the character of a valence-loss peak 
(Crecelius et al., 1983). If the measured value of the dispersion coefficient is com- 
parable to the RPA value given by Eq. (3.46), collective behavior is suspected; if a 
is close to zero, an interband transition may be involved. 

Unless the energy-loss spectrum is recorded using a sufficiently small collection 


aperture (semi-angle << oy! ey contributions from different values of g cause a slight 
broadening and upward shift of the plasmon peak. 


3.3.1.4 Critical Wavevector 


Above a certain wavevector gc, the plasma oscillations in a “free-electron gas” are 
very heavily damped because it becomes possible for a plasmon to transfer all of 
its energy to a single electron, which can then dissipate the energy by undergoing 
an interband transition. Such an event must satisfy the usual conservation rules; if 
an energy E and momentum fi q are to be transferred to an electron of mass mo 
that initially had a momentum fi q;, conservation of both energy and momentum 
requires 


E = (h*/2mo)(q + q;)” — (h*/2mo)q? = (h? /2mo)(q? + 2q - q)) (3.48) 


The minimum value of gq that satisfies Eq. (3.48) corresponds to the situation where 
qi is parallel to q and as large as possible, namely, g; = gp, where gp is the Fermi 
wavevector. Denoting this minimum value of g as qc and substituting for E = Ep(q) 
using Eq. (3.45) gives 


Ep + a(h? /mo)qz = (h? /2mo)(qz + 24c9r) (3.49) 


If the dispersion coefficient @ is not greatly different from 0.5, the quadratic terms 
on both sides of Eq. (3.49) almost cancel and to a rough approximation 
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ge = moEp/(h? qr) = Ep/(hve) (3.50) 


where vr is the Fermi velocity. Equation (3.50) is equivalent to wp/g ~ vr; in other 
words, energy transfer becomes possible when the phase velocity of the plasmon 
falls to a value close to the velocity of electrons at the Fermi surface. More precisely, 
qc is defined by intersection of the curves representing Eqs. (3.45) and (3.48) with 
qi = qr; indicated by the point P in Fig. 3.14. 

A jellium model therefore predicts that inelastic scattering due to plasmon exci- 
tation should fall abruptly to zero above a critical (or cutoff) angle 6, that is related 
to the critical wavevector q_ by 


Ge © ko(02 + Of) © koe (3.51) 


A more sophisticated calculation based on Hartree—Fock wavefunctions (Ferrell, 
1957) predicts a gradual cutoff described by a function G(g, g-) that falls (from 
unity) to zero at gq = 0.74 qp, in somewhat better agreement with experimental 
data; see Fig. 3.15. In fact, experimental evidence suggests that inelastic scattering is 
partly collective in nature at wavevectors considerably above gq, (Batson and Silcox, 
1983). 


6 (mrad) 
for Eg740 keV O 6 ) 8, 


Fig. 3.15 Angular dependence of the differential cross section do/dQ for plasmon scattering 
as calculated by Ferrell (solid curve) and using a sharp cutoff approximation (dashed line). 
Experimental data of Schmiiser (1964) for aluminum and 40-keV incident electrons are indicated 
by the solid circles. Also shown are the characteristic, median, mean, and cutoff angles, calculated 
using Eqs. (3.51), (3.52), (3.53), (3.54), (3.55), and (3.56) 
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3.3.1.5 Mean, Root-Mean-Square, and Median Scattering Angles 


The mean scattering angle @ associated with plasmon scattering can be defined as 


a= fo (Z) ao / | Tao = | 6 (<) anf [ S “d2 3.52) 


where the integration is over all scattering angle 0 or all solid angle Q. If the dif- 
ferential cross section has a Lorentzian angular dependence with an abrupt cutoff at 
@ = 6, the approximation dQ = 2z(sin 6) dO © (276) dO (since O& « 1) leads to 


j 

= 62d0 — Or. arctan(6./6 

f= : [ot = - Ep ( _ (3.53) 
| areey ae Vint + (6¢/Or,)1 


with Gz, = E,/mov’, as in Eq. (3.44). Similarly, a mean-square angle can be 
evaluated as 


(0?) = 62/1n(1 + 62/02) — 02, (3.54) 


The root-mean-square angle @(rms) is the square root of (67) and is used in the 
analysis of the angular distribution of multiple scattering. 
A median scattering angle 6 can also be defined, such that half of the scattering 


occurs at angles less than 0: 
0c do 
| LS - (3.55) 


Making a low-angle approximation, as above, gives 
A = Op, (8c/Oz, — 1)'/? © (O¢,6¢)!/ (3.56) 


For 100-keV incident electrons, 6,/@g, ~ v/vp © 100 in a typical material (see 


Table 3.2), giving 0 ~ 226, and 6 100¢,. These average scattering angles are at 
least an order of magnitude larger than 6,,, reflecting the 27 sin@ weighting factor 
that relates dQ and dé@ and the wide “tails” of the Lorentzian angular distribution, 
compared to a Gaussian function of equal half-width. 

Besides being the half-width of the differential cross section da/dQ, which 
represents the amount of scattering per unit solid angle, 0g, is the most probable 
scattering angle, corresponding to the maximum in do /d6; see Fig. 3.15. 

Equations (3.53), (3.54), (3.55), and (3.56) apply to any scattering with a 
Lorentzian angular distribution that terminates at a cutoff angle 6,. They are use- 
ful approximations for single-electron excitation (including inner-shell ionization) 
at an energy loss E, with 0g, replaced by the appropriate characteristic angle 
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Oz = E/(ymov’) and the Bethe ridge angle 6, ~ (E/Eo)'/? * (26g)'/? used as 
the cutoff angle; see Section 3.6.1. The same equations may also be applicable to 
total-inelastic scattering (integrated over all energy loss), replacing 6g, by Og and 
setting 0. © 69 = 1/(koro), the effective cutoff angle beyond which do/dQ changes 
from a @~* to a 64 angular dependence; see Section 3.2.1. 


3.3.1.6 Plasmon Cross Section and Mean Free Path 


Provided B < 6, the free-electron approximation of Eq. (3.44a) gives the integral 
cross section per atom (or per molecule) as follows: 


Ep|n(1 + B°/0z,) Ey In(B/6g,) 


op( (3.57) 


2n,dagmov N,dgmov 


where n, is the number of atoms (or molecules) per unit volume and the approxima- 
tion in Eq. (3.57) applies to the case 8 >> Og. An inverse measure of the amount of 
scattering below the angle f is the mean free path Ap(B) = [naop(B)I!, given by 


2apmov2 ao 


EpIn(1 + 62/62.) 9, In(B/4e,) 


Ap(B) = (3.58) 


These free-electron formulas give reasonably accurate values for “free-electron” 
metals such as aluminum (Fig. 3.16a) but apply less well to transition metals 
(Fig. 3.9), where single-electron and core-level transitions considerably modify the 
energy-loss spectrum. 
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Fig. 3.16 (a) Incident energy dependence of mean free path for valence excitation, as predicted 
by free-electron theory, Eq. (3.58) with 8 = 6. = 7.7 mrad, and as determined from EELS mea- 
surements. Batson and Silcox (1983) give two values, the lower one including the plasmon peak 
background due to single-electron transitions. (b) Collection-angle dependence of mean free path, 
as predicted by Eq. (3.58) with 6 = 6, = 8.5 mrad and from EELS measurements (Egerton, 1975) 
including all energy losses up to 50 eV 
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Estimates of the total plasmon cross section and mean free path are obtained by 
substituting 8 = @, in Eqs. (3.57) and (3.58), implying a sharp cutoff of intensity at 
0 = Oc. For 100-keV incident electrons, Ap is of the order of 100 nm (see Table 3.2, 
page 138). In practice, single-electron excitation causes some inelastic scattering to 
occur above 6¢, so the measured inelastic mean free path decreases by typically 10- 
20% between B © 10 mrad (a typical plasmon cutoff angle) and 6 ~ 150 mrad (the 
maximum collection angle possible in a typical TEM, limited by the post-specimen 
lenses); see Figs. 3.16b and 5.2d. 


3.3.2 Single-Electron Excitation 


As discussed in the preceding section, the plasmon model accounts for the major 
features of the low-loss spectrum of materials such as Na, Al, and Mg where motion 
of the conduction electrons is relatively unaffected by the crystal lattice. The plas- 
mon peaks are particularly dramatic in the case of alkali metals, where E, falls 
below the ionization threshold (Fig. 3.10), giving low plasmon damping. 

In all materials, however, there exists an alternative mechanism of energy loss, 
involving the direct transfer of energy from a transmitted electron to a single atomic 
electron within the specimen. This second mechanism can be regarded as compet- 
ing with plasmon excitation in the sense that the total oscillator strength per atom 
must satisfy the Bethe sum rule, Eq. (3.34). The visible effects of single-electron 
excitation include the addition of fine structure to the energy-loss spectrum and a 
broadening and/or shift of the plasmon peak, as we now discuss. 


3.3.2.1 Free-Electron Model 


In Section 3.3.1, Eq. (3.48) referred to the transfer of energy from a plasmon to a 
single atomic electron, but this same equation applies equally well to the case where 
the energy E is supplied directly from a fast electron. By inspecting Eq. (3.48) it can 
be seen that, for a given value of g, the maximum energy transfer E(max) occurs 
when q; is parallel to q and as large as possible (i.e., gi; = qr) so that 


E(max) = (h? /2mo)(q? + 24qF) (3.59) 


The minimum energy loss E(min) corresponds to the situation where q; is antipar- 
allel to q and equal to gp, giving 


E(min) = (h? /2mo)(q° — 24) (3.60) 


Within the region of g and E defined by Eqs. (3.59) and (3.60) (the shaded area in 
Fig. 3.17), energy loss by single-electron excitation is kinematically allowed in the 
free-electron approximation. The Lindhard model (Section 3.3.1) predicts the prob- 
ability of such transitions and shows (Fig. 3.14) that they occur within the expected 
region, but mainly at higher values of g. At large g, Im[—1/e] becomes peaked 
around E = (h*/2m)q’, as predicted by Bethe theory (see later, Fig. 3.36). 
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Fig. 3.17 Energy loss as a E XY 
function of scattering vector, / 
showing the region defined by E (max) ae SS m4 
Eqs. (3.59) and (3.60), over Y 
ye 
/ 


which single-electron 
excitation is allowed 
according to the jellium 
model. Also shown (dashed) 
is the plasmon dispersion 
curve Ep(q). The horizontal 
arrow indicates momentum 
transfer from the lattice, 
resulting in damping of the 


plasmon E(min) 


In terms of particle concepts, large g corresponds to a collision with small impact 
parameter; if the incident electron passes sufficiently close to an atomic electron, 
the latter can receive enough energy to be excited to a higher energy state (single- 
electron transition). In contrast, atomic electrons further from the path of the fast 
electron may respond collectively and share the transferred energy. 

The relationship between collective and single-particle effects is further illus- 
trated in the plasmon pole (or single-mode) model developed by Ritchie and Howie 
(1977). In their treatment, two additional terms occur in the denominator of Eq. 
(3.43), resulting in a dispersion relation 


Es = [Ep(q)P? = Ep + (3/5)(q/e)E, + hitg* /Ams (3.61) 


which for small g reduces to the plasmon dispersion relation, Eqs. (3.45) and (3.46), 
and at large q to the energy-momentum relation (E = h?q”/2mo) for an isolated 
electron (dashed line in Fig. 3.17). An expression for the energy-loss function can 
also be derived; neglecting plasmon damping, the differential cross section takes the 
form (Ritchie and Howie, 1977) 


2 72/2 ii 2 F 
do _ moe Ex(v —2E,/m) se E, ( ! (3.62) 


dQ” In hAVE,@?ng ~ 2nhev?E2 \ 6? + 62 
with 6g = Eq/(ymo v*). Equation (3.62) becomes equivalent to the plasmon formula, 
Eq. (3.44), at small scattering angles. At large g, where the momentum is absorbed 
mainly by a single electron that receives an energy much larger than its binding 


energy, Eq. (3.62) becomes the Rutherford cross section for scattering from a free 
electron: Eq. (3.3) with Z = 1. 
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3.3.2.2 The Effect of Band Structure 


The free-electron plasmon model is often a good approximation (Figs. 3.12 and 
3.16), but in most materials the lattice has a significant effect on electron motion, 
as reflected in the band structure of the solid and the nonspherical form of the 
Fermi surface. Single-electron transitions can then occur outside the shaded region 
of Fig. 3.17, for example, at low q, the necessary momentum being supplied by the 
lattice. 

The transition rate is determined by details of the band structure. Where the 
single-electron component in a loss spectrum is high (e.g., transition metals) the 
E-dependence may exhibit a characteristic fine structure. In semiconductors and 
insulators, this structure reflects a joint density of states (JDOS) between the valence 
and conduction bands. Peaks in the JDOS occur where branches representing the 
initial and final states on the energy-momentum diagram are approximately parallel 
(Bell and Liang, 1976). 


3.3.2.3 Damping of Plasma Oscillations 


As remarked in Section 3.3.1, plasma resonance in solids is highly damped and 
the main cause of this damping is believed to be the transfer of energy to single- 
electron transitions (creation of electron-hole pairs). On a free-electron model, such 
coupling satisfies the requirements of energy and momentum conservation only if 
the magnitude of the plasmon wavevector g exceeds the critical value gc. In a real 
solid, however, momentum can be supplied by the lattice (in units of a reciprocal 
lattice vector, i.e., an Umklapp process) or by phonons, enabling the energy transfer 
to occur at lower values of g, as indicated schematically by the horizontal arrow in 
Fig. 3.17. The energy of a resulting electron-hole pair is eventually released as heat 
(phonon production) or electromagnetic radiation (cathodoluminescence). 

In fine-grained polycrystalline materials, grain boundaries may act as an addi- 
tional source of damping for low-g (long-wavelength) plasmons, resulting in a 
broadening of the energy-loss peak at small scattering angles (Festenberg, 1967). 
As the diameter d of silicon spheres decreased from 20 to 3.5 nm, Mitome et al. 
(1992) observed a similar increase in width, presumably due to increased damping 
by the surfaces. They also reported an increase in plasmon energy (from 16.7 to 
17.5 eV) proportional to 1/d*. The authors argue that dispersion (plasmon wave- 
length constrained to be less than the particle diameter) would cause an increase of 
only 0.1 eV and that their measurements indicate a quantum size effect. 


3.3.2.4 Shift of Plasmon Peaks 


Single-electron and plasmon processes are also linked in the sense that the 
intensity observed in the energy-loss spectrum is not simply a sum of two inde- 
pendent processes. Susceptibility x is the additive quantity, not the energy-loss 
function. 
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Drude theory can be extended to the case of electrons that are bound to the ion lat- 
tice with a natural frequency of oscillation wp, the equation of motion in an external 
field E becoming 

mx + mx + mopx = —eE (3.63) 
The solution is now x = (—eE/ m)(w~ + iT — wp”)! and if nis the electron density, 
the polarization has an amplitude P = e9xpE = enx, giving 


2 


e=1+7x% =1+——?—_ (3.64) 
wo, — ow? — io! 


w 


where Wp = [ne /(m €9)]'/* would be the plasma resonance frequency if the elec- 
trons were free, whereas in fact the resonance peak is shifted to an angular frequency 
(5 + wp)'/ >. This situation applies to the valence electrons in a semiconductor or 
insulator, where the binding energy h wp is comparable with the energy gap Ey. The 
energy E of the resonance maximum is given by 


(Eby ~ ES + Ey (3.65) 


For most practical cases E> < < E, which may explain why the valence-resonance 
peak of semiconductors (and even insulators) is given fairly well by the free-electron 
formula; see Table 3.3. The extended Drude model provides a fair approximation to 
the energy-loss function of insulators such as alumina, although the behavior of the 
real and imaginary parts of the permittivity is less well represented (Egerton, 2009). 

If there are n¢ free electrons and np bound electrons per unit volume, ¢ = 1 + 
xt + Xb, where x¢ is given by Eq. (3.39a) with n = ng and Xp by Eq. (3.64). The 
resonance energy EP is then given (Raether, 1980) by 


2 


E 
by\2 ~~ P 
GY © Taal Cny (3.66) 


Table 3.3. Experimental energy Ep (expt) of the main peak in the energy-loss spectra of several 
semiconductors and insulators, compared with the plasmon-resonance energy Ep (calc) given by 
Eq. (3.41), where n is the number of outer-shell (valence) electrons per unit volume. Values in eV, 
from Raether (1980) 


Material Ep (expt) Ep (calc) 
Diamond 34 31 

Si 16.5 16.6 

Ge 16.0 15.6 
InSb 12.9 12.7 
GaAs 15.8 15.7 


NaCl 15.5 15.7 
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Fig. 3.18 (a) Dielectric properties of a free-electron gas with Ey = 16eV and AE, = 4eV. 
(b) Drude model with interband transitions added at Ey = 10 eV (Daniels et al., 1970) 


and can be raised or lowered, depending on the sign of x. If interband transitions 
take place at an energy that is higher than E,, Eq. (3.64) indicates that xp(Ep) is 
positive. Polarization of the bound electrons reduces the restoring force on the dis- 
placed free electrons, reducing the resonance energy below the free-electron value. 
Conversely, if interband transitions occur at a lower energy, the plasmon peak is 
shifted to higher energy, as in Fig. 3.18. 

In a semiconductor, ng corresponds to a relatively low density of conduction 
electrons. Even for ng ~ 10!° cm7?, the corresponding resonance corresponds to 
howe © 0.25 eV, too small to be easily observable by EELS, and with negligible 
influence on the resonance of the valence electrons. At high doping levels, however, 
the electron concentration in an n-type semiconductor may become high enough to 
give a second plasma resonance peak that is detectable in a high-resolution TEM- 
EELS system. A 1.8-eV peak in the spectrum of LaBg has been attributed to this 
cause, the valence-electron resonance being at 19 eV (Sato et al., 2008a). Graphite 
and graphitic nanotubes can be doped with metals to introduce conduction electrons, 
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resulting in a plasmon peak below 2 eV, in addition to the m-resonance peak around 
6 eV and the (7 + o) resonance around 27 eV (Liu et al., 2003). 

In the case of a metal, interband transitions can cause a second resonance if the 
value of np is sufficiently large. For energies just below Ep, €1(£) is forced posi- 
tive (see Fig. 3.18) and therefore crosses zero with positive slope (a condition for 
a plasma resonance) at two different energies. Such behavior is observed in silver, 
resulting in a (highly damped) resonance at 3.8 eV as well as the “free-electron” 
resonance at 6.5 eV (see Fig. 3.19). In the case of copper and gold, interband tran- 
sitions lead to a similar fluctuation in ¢; (Fig. 3.19), but they occur at too low an 
energy to cause £1 to cross zero and there is only one resonance point, at 5 eV in 
gold. 

The Drude model is used in connection with electrical conduction in metals, the 
de conductivity being given by 


o(0) = e9TQp” = T ge? /mo (3.66a) 
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Fig. 3.19 Energy-loss function Im(—1/e) and real part ¢; of the dielectric function, derived from 
energy-loss spectroscopy of polycrystalline films of silver, gold, and copper. From Daniels et al. 
(1970), copyright Springer 
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Here {2p is a plasmon frequency related to the quasi-free electrons (density ne) 
that can undergo intraband transitions within the conduction band, and therefore 
contribute to conductivity. In the case of aluminum, Qp) ~ 12.5 eV, the intra- 
band and interband contributions to the oscillator strength being ng = 1.9 and 
np = 1.2 electrons/atom, according to Smith and Segall (1986). 

Layer crystals such as graphite and boron nitride have much weaker bonding 
in the c-axis direction than within the basal (cleavage) plane. Each carbon atom 
in graphite has one zr-electron and three o-electrons, the latter responsible for 
the strong in-plane bonding. If isolated, these two groups of electrons might have 
free-electron resonance energies of 12.6 and 22 eV, but coupling between the two 
oscillating systems forces the resonance peaks apart; the observed peaks in the loss 
spectrum occur at about 7 and 27 eV (Liang and Cundy, 1969). 

In anisotropic materials the dielectric function ¢(q, E) is actually a tensor €;; and 
the peak structure in the energy-loss spectrum depends on the direction of the scat- 
tering vector q. For a uniaxial crystal such as graphite, axes can be chosen such that 
off-diagonal components are zero, in which case e(q, FE) = €1 sin2?@ + é|| cos?@, 
where €, = €]] = €22 and €|; = €33 are components of ¢(£) perpendicular and 
parallel to the c-axis; © is the angle between q and the c-axis, which depends on the 
scattering angle and the specimen orientation. If the c-axis is parallel to the incident 
beam, the greatest contribution (for 6 >> @¢) comes from perpendicular excita- 
tions and two plasmon peaks are observed. Under special conditions (e.g., small 
collection angle 6), the qg||c excitations predominate and the higher energy peak is 
displaced downward in energy. Further details about EELS of anisotropic materials 
are given in Daniels et al. (1970) and Browning et al. (1991b). 


3.3.3 Excitons 


In insulators and semiconductors, it is possible to excite electrons from the valence 
band to a Rydberg series of states lying just below the bottom of the conduction 
band, resulting in an energy loss E; given by 


E, = Eg — Ep/n* (3.67) 


where Eg is the energy gap, Ep is the exciton binding energy, and nis an integer. The 
resulting excitation can be regarded as an electron and a valence band hole, bound 
to each other to form a quasiparticle known as the exciton. 

Although the majority of cases fall between the two extremes, two basic types 
can be distinguished. In the Wannier (or Mott) exciton, the electron—hole pair is 
weakly bound (Ep, < 1 eV) and the radius of the “orbiting” electron is larger than 
the interatomic spacing. The radius and binding energy can be estimated using a 
hydrogenic formula: Ey = e*/(8eeor) = moet /(8e7e,h?n’), where « is the rel- 
ative permittivity at the orbiting frequency (usually the light-optical value). The 
electron and hole may travel together through the lattice with the absorbed momen- 
tum fig. Such excitons exist in high-permittivity semiconductors (e.g., CugO and 
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CdS) but a high-resolution spectrometer system is needed to detect the associated 
energy losses. 

Frenkel excitons are strongly bound and relatively compact, the radius of the 
electron orbit being less than the interatomic spacing. These are essentially excited 
states of a single atom and in some solids may be mobile via a hopping mechanism. 
For the alkali halides, where there is probably some Wannier and some Frenkel 
character, the binding energy amounts to several electron volts, tending to be lower 
on the anion site. 

Energy-loss peaks due to transitions to exciton states are observed adjacent to 
the “plasmon” resonance peak in alkali halides (Creuzburg, 1966; Daniels et al., 
1970), in rare gas solids (Daniels and Kriiger, 1971), and in molecular crystals such 
as anthracene. The peaks can be labeled according to the transition point in the 
Brillouin zone and often show a doublet structure arising from spin-orbit splitting 
(Fig. 3.20). 

The energy E, of an exciton peak should obey a dispersion relation: 


Ex(q) = Ex(0) + hig? /2m* (3.68) 


where m* is the effective mass of the exciton. However, measurements on alkali 
halides show dispersions of less than | eV (Creuzburg, 1966) suggesting that m* >> 
mo (Raether, 1980). 
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Fig. 3.20 Energy-loss function for KBr, calculated from EELS measurements (full line) and from 
optical data (broken line). The peak at about 13.5 eV is believed to represent a plasma resonance of 
the valence electrons but the other peaks arise from excitons. From Daniels et al. (1970), copyright 
Springer 
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3.3.4 Radiation Losses 


If the velocity v of an electron exceeds (for a particular frequency) the speed of light 
in the material through which it is moving, the electron loses energy by emitting 
Cerenkov radiation at that frequency. The photon velocity can be written as c/n = 
c/./é1, where n and ¢€, are the refractive index and relative permittivity, respectively, 
of the medium, so the Cerenkov condition is satisfied when 


£1(E) > c?/v? (3.69) 


In an insulator, €; is positive at low photon energies and may considerably exceed 
1. In diamond, for example, ¢; > 6 for3eV < E < 10eV, so Cerenkov 
radiation is generated by electrons whose incident energy is 50 keV or higher, 
resulting in a “radiation peak” in the corresponding range of the energy-loss spec- 
trum (see Fig. 3.21a). The photons are emitted in a hollow cone of semi-angle 
g= cos!(ev-tey *) but are detected only if the specimen is tilted to avoid total 
internal reflection. 

Kréger (1968) developed relativistic formulas that include the retardation effects 
responsible for Cerenkov emission. For relatively thick specimens, where internal 
reflection of photons and surface plasmon excitation can be neglected, the double- 
differential cross section becomes (Festenberg and Kroger, 1968) 
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Fig. 3.21 (a) Solid line: low-loss spectrum of a 262-nm diamond sample, recorded using 55-keV 
electrons (Festenberg, 1969). The dashed curve is the intensity derived from the relativistic theory 
of Kroger (1968), while the dotted curve is calculated without taking into account retardation. 
(b) Calculated angular dependence of the radiation-loss intensity for a 210-nm GaP specimen, 50- 
keV incident electrons, and four values of energy loss. From Festenberg (1969), copyright Springer 
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(3.70) 


The Lorentzian angular term of Eq. (3.32) is here replaced by a more compli- 
cated function whose “resonance” denominator decreases to a small value (for 
small ¢2) at an angle 6) = On (EV /c? —1)!/?. As a result, the angular distribu- 
tion of inelastic scattering peaks sharply at small angles (<0.1 mrad). The calculated 
peak position and width (as a function of energy loss; see Fig. 3.21b) are in broad 
agreement with experiment (Chen et al., 1975) but for thin specimens a more com- 
plicated formula, Eq. (3.84), must be used. Because the value of @p is so small, the 
radiation-loss electrons pass through an on-axis collection aperture of typical size 
and can dominate the energy-loss spectrum at low energies, contributing additional 
fine structure that interferes with bandgap measurements (Stdger-Pollach et al., 
2006). 

Equation (3.70) shows that retardation effects cause the angular distribution to 
depart from the Lorentzian form within the energy range for which ¢,v7/c* > 0.5 
(Festenberg and Kréger, 1968), a less restrictive condition than Eq. (3.69). This con- 
dition is also fulfilled at relatively high energy loss (where ¢; ~ | in both conductors 
and insulators) if v*/c? exceeds approximately 0.5, leading again to deviation from 
a Lorentzian angular distribution when the incident energy is greater than about 
200 keV; see Appendix A. 

Energy is also lost by radiation when an electron crosses a boundary where the 
relative permittivity changes. This transition radiation results not from the change 
of velocity but from change in the electric field strength surrounding the electron 
(Frank, 1966; Garcia de Abajo, 2010). Polarized photons are emitted with energies 
up to approximately 0.5 h w(1 — v*/c?)~!/? (Garibyan, 1960), but the probability 
of this process appears to be of the order of 0.1%. 

As a result of Cerenkov and transition losses, the electron energy-loss spectrum 
below 5 eV (where nonretarded losses are small) can provide a direct measure of 
the optical density of states (Garcia de Abajo et al., 2003). Although this connec- 
tion may not hold at all planes in a structure (Hohenester et al., 2009), the EELS 
measurement involves integration along the beam direction and good agreement 
has been obtained between experimental results and calculated ODOS (Cha et al., 
2010). The advantage of using an electron beam over optical excitation is the possi- 
bility of nanometer-scale resolution, which offers the option of examining repetitive 
nanostructures of limited dimensions or those containing defects. 

It is possible for an electron to gain energy at a surface illuminated by photons, 
and electron energy-gain spectroscopy (EEGS) has been proposed as a method of 
investigating nanostructures, combining high spatial and energy resolution (Garcia 
de Abajo and Kociak, 2008). Energy gains of 200-keV electrons at the surface of 
a carbon nanotube have been reported, together with the possibility of mapping 
the electric field around nanostructures on a femtosecond timescale (Barwick et al., 
2009). 
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3.3.5 Surface Plasmons 


Analogous to the bulk or volume plasmons that propagate inside a solid, there are 
longitudinal waves of charge density that travel along an external surface or an 
internal interface, namely, surface or interface plasmons. The electrostatic poten- 
tial at a planar surface is of the form cos(qx — wt) exp(—q|z|), where g and are the 
wavevector and angular frequency of oscillation and ¢ represents time. The surface 
charge density is proportional to cos(qx — wf)d(z), and continuity of the electric field 
leads to the requirement: 


Eqg(@) + Eep(w) =0 (3.71) 


where €, and é, are the relative permittivities on either side of the boundary. 
Equation (3.71) defines the angular frequency wg of the surface plasmon. Equation 
(3.71) can be satisfied at the interface with a metal because the real part of the 
permittivity becomes negative at low frequencies; see Fig. 3.19. 


3.3.5.1 Free-Electron Approximation 


The simplest situation corresponds to a single vacuum/metal interface where the 
metal has negligible damping (I. — 0). Then ¢g = | and ey © 1 — os /w*, where 
@p = E,/h is the bulk plasmon frequency in the metal. Substitution into Eq. (3.71) 
gives the energy E, of the surface plasmon peak in the energy-loss spectrum: 


Ex = hws = hwp//2 = Ep/ V2 (3.72) 


A more general case is a dielectric/metal boundary where the permittivity of the 
dielectric has a positive real part ¢; and a much smaller imaginary part 2 for fre- 
quencies close to ws. Again assuming negligible damping in the metal, Eq. (3.72) 
becomes 


E, = Ep/(1 + €1)'? (3.73) 
and the energy width of the resonance peak is 
AE, = h/t = Esen(1 + €1)>/” (3.74) 


Stern and Ferrell (1960) have calculated that a rather thin oxide coating (typically 
4 nm) is sufficient to lower E; from E,/./2 to the value given by Eq. (3.74), and 
experiments done under conditions of controlled oxidation support this conclusion 
(Powell and Swan, 1960). 

Equation (3.74) illustrates the fact that surface-loss peaks occur at a lower 
energy loss than their volume counterparts, usually below 10 eV. In the case of 
an interface between two metals, Eq. (3.71) leads to a surface plasmon energy 
[(E2 + E?) Pll ? but in this situation, Jewsbury and Summerside (1980) have 
argued that the excitation may not be confined to the boundary region. 


3.3. Excitation of Outer-Shell Electrons 157 


The intensity of surface plasmon scattering is characterized not by a differen- 
tial cross section (per atom of the specimen) but by a differential “probability” of 
scattering per unit solid angle, given for a free-electron metal (Stern and Ferrell, 
1960) by 


a" ( : ) CE F0,6.W) (3.75) 
dQ raopmov \1 4+ €1 (62 + 62) _ 
1+ (62/0) oe 
£0.61.) = | —— a — — (tan; cos + 66/0)" (3.76) 
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where @ is the angle of scattering, 6; is the angle between the incident electron and 
an axis perpendicular to the surface, and w is the angle between planes (perpendic- 
ular to the surface) that contain the incident and scattered electron wavevectors. In 
Eq. (3.76), @ and 6; can be positive or negative and 6g is equal to E,/ymov’. 

The angular distribution of scattering is shown in Fig. 3.22. For normal incidence 
(6; = 0), f(@, 6, Ww) = 1 and (unlike the case of volume plasmons) the scattered 
intensity is zero in the forward direction (@ = 0), a result of the fact that there is 
then no component of momentum transfer along the boundary plane (momentum 
perpendicular to the surface is absorbed by the lattice). The intensity rises rapidly to 
a maximum at 6 = +0,/,/3, so the minimum around 6 = 0 is not easy to observe 
with a typical TEM collection aperture and incident beam divergence. 


(b) 


53 ‘\ cos8, 
el an ye : 
Pg \ \7kGesin®; 


-2 -1 0 1 2 %, 
surface 


Fig. 3.22 (a) Angular distribution of surface scattering; note the asymmetry and higher integrated 
intensity of scattering in the case of a tilted (6; = 45°) specimen. (b) Vector diagram illustrating 
the relationship between q and gq; for the case where @ and 6; have the same sign and the scattering 
angle is within the plane of incidence (yw = 0) 
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For |@| >> 6g, the intensity falls proportional to @~? rather than @~°, as in the 
case of bulk losses. This means that a small collection aperture, displaced off-axis 
by a few @z¢, can largely exclude surface contributions to the loss spectrum (Liu, 
1988). 

For nonnormal incidence, f(@, 61, %) 4 f(— 0, 6, wW), leading to an asym- 
metrical angular distribution that has a higher maximum intensity as a result of the 
cos” 6; denominator in Eq. (3.76). This asymmetry has been verified experimentally 
(Kunz, 1964; Schmiiser, 1964). The zero in dP,/dQ again corresponds to the case 
where the momentum transfer i q occurs in a direction perpendicular to the surface. 

The total probability for surface plasmon excitation at a single vacuum interface 
is obtained by integrating Eq. (3.75) over all 6. For normal incidence (6; = 0) the 
result is (Stern and Ferrell, 1960) 


wh e 


~ aomov(1 + 1) 4eohv( + €1) 


(3.77) 


At 100-keV incident energy, Ps; is 0.021 for ¢; = 1 (vacuum/metal interface) and 
0.011 for e = 3 (typical of many oxides). Taking into account both surfaces, 
the probability of surface plasmon excitation in an oxidized aluminum sample is 
therefore about 2% and the corresponding loss peak (at just over 7 eV) is clearly 
visible only in rather thin samples, where the inelastic scattering due to bulk pro- 
cesses is weak. However, if the specimen is tilted away from normal incidence, P, 
is increased as a result of the cos 6; term in Eq. (3.76). 


3.3.5.2 Dielectric Formulation for Surface Losses 


The free-electron approximation ey = 1 — Os /w” can be avoided by characterizing 
the materials (conductors or insulators) on both sides of the boundary by frequency- 
dependent permittivities ¢, and ¢,. Dielectric theory then provides an expression 
for the differential “probability” of surface scattering at a single interface (Raether, 
1980): 


dQdE ~—— 12aymov2q* cos 6; Ea€b(Ea + Eb) 


k2\qs —4 1 1 
= 0l9s| in| cf 
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CPo Klas| in| (Ea — &) 
(3.78) 


assuming that the electron remains in the plane of incidence (Ww = 0). The last two 
terms in Eq. (3.78) represent the begrenzungs effect, a reduction in the bulk plasmon 
intensity (see later). In the small-angle approximation, g* = Re + 63) and (see 
Fig. 3.22) 


ds = ko8 cos 6; + koOg sin 6; (3.79) 


where qs is the wavevector of the surface plasmon, equal to the component of the 
scattering vector that lies parallel to the surface. One of the terms in Eq. (3.79) 
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is negative if @ and 6; are of opposite sign. Note that Eqs. (3.78) and (3.71) are 
symmetric in €g and &», so the direction of travel of the incident electron is unim- 
portant. The width of the plasmon peak is determined by the imaginary parts of the 
permittivities on both sides of the boundary. 

For the case of perpendicular incidence (6; = 0), Eq. (3.78) can be integrated up 
to a scattering angle 6 to give 


dP, 1 tan—!(B/0z) B —4 1 1 
~ Im + + 
dE 2nagkopT Or (B2 + 62) Eat€p Ea Eb 


for a single interface, where T = mov” /2. 


3.3.5.3 Very Thin Specimens 


The surface plasmons excited on each surface of a specimen of thickness ¢ are almost 
independent of each other if 


qst © koOt > 1 (3.81) 


For 100-keV incident electrons (kg = 1700 nm7!), 6, = Oand 6 & O6¢//3 & 
0.1 mrad (the most probable angle of surface scattering; see Fig. 3.22), Eq. (3.81) 
implies t >> 10 nm. If this condition is not fulfilled, the electrostatic fields originat- 
ing from the two surfaces overlap and the surface plasmons interact with each other; 
see Fig. 3.23. In the case of a free-electron metal bounded by similar dielectrics 
(€g = €& = € ande,p = 1 — @p> / wo”) the resonance is split into two modes, the fre- 
quency of each being g-dependent and given approximately, for large gs, by (Ritchie, 
1957) 


= 1/2 
1 exp >) es) 


ee l+e 
The symmetric mode, where like charges face one another (Fig. 3.23b), corresponds 
to the higher angular frequency. For small qs, Eq. (3.82) does not apply; relativistic 
constraints (Kroger, 1968) cause @ to lie below! the photon line (@ = cqs) on 
the dispersion diagram, as shown in Fig. 3.24. This dispersion behavior has been 
verified experimentally (Pettit et al., 1975). 

Assuming normal incidence and neglecting retardation effects, the differential 
probability for surface excitation at both surfaces of a film of permittivity e, and 
thickness ¢ can be expressed (Raether, 1967) as 


'There are, in fact, radiative surface plasmons that lie above this line, but they are less easily 
observed in the energy-loss spectrum because for small scattering angles their energy is the same 
as that of the volume plasmons (see Fig. 3.24). 
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Fig. 3.23 Electric field lines = soe = 
associated with surface 
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Fig. 3.24 Dispersion diagram for surface plasmons. The dashed lines represent Eq. (3.82); the 
solid curves were calculated taking into account retardation, for film thicknesses given by gpt = 
0.01, 0.63, and co, where gp = wp/c. For 6; = 0, the horizontal axis is approximately proportional 
to scattering angle @ of the fast electron, since gs/qp © (v/ c)|(0/6g). The dispersion relation of 
a radiative plasmon is shown schematically by the dotted curve. From Raether (1980), copyright 
Springer-Verlag 
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where T = movo2 /2 and 


ie eqsin’ (tE/2hv) x &qcos*(tE/2hv) (3.83a) 
p+ €qtanh(gst/2)  & + €qcoth(g,st/2) 


For large film thickness, R, becomes equal to €4/(€q + €p), where &, is the permit- 
tivity of the surroundings (€g = | for vacuum). A relativistic version of Eq. (3.83), 
given by Kréger (1968), shows that relativistic effects modify the dispersion relation 
of the surface plasmon (particularly at low qs) and result in slightly lower resonance 
energies, in closer agreement with experiment. 

Integrating Eq. (3.83) over energy loss, Ritchie (1957) investigated how the total 
probability P, of surface plasmon loss (for a free-electron metal) varies with sample 
thickness. For t >> v/@p (=7.2 nm for Eg = 100 keV, E, = 15 eV), P; becomes 
independent of thickness and tends asymptotically to e*/(4eqhv), twice the value 
given by Eq. (3.77) since the specimen has two surfaces. For t < 5(v/mp), the 
surface-loss probability increases slightly (see Fig. 3.25), corresponding to increase 
in the asymmetric (@_) surface mode. But as shown by Eq. (3.82), the energy loss 
associated with this mode tends to zero as t —> 0, so the total energy loss due to 
surface plasmon excitation falls toward zero for very thin films, as indicated by the 
dashed curve in Fig. 3.25. 


3.3.5.4 Begrenzungs Effect 


A secondary effect of surface plasmon excitation is to reduce the intensity of the 
bulk plasmon peak, the begrenzungs (boundary) effect. Within a distance of the 
order of v/@p of each surface, the transmitted electron excites surface rather than 
bulk plasmons. This effect gives rise to a negative value of the energy-loss function 
at E ~ h wp, represented by the —Im(—1/e,) and —Im(—1/e,) terms in Eqs. (3.78) 
and (3.80). 

For t > v/@p, the calculated reduction AP, in the probability Py of volume 
plasmon excitation is just equal to the probability of surface plasmon generation at 
a single surface (see Fig. 3.25), which is typically about 1% and small compared 
to Py. However, this negative surface contribution occurs mainly at small angles 
(6 © Og), due to the relatively narrow angular distribution of the surface-loss inten- 
sity. Its effect is therefore greater if the energy-loss spectrum is recorded with a 
small angular-collection aperture. For an effective aperture of 0.2 mrad and 50-keV 
electrons transmitted through a film of aluminum, Raether (1967) reported a reduc- 
tion in the volume-loss intensity of 8% at tf = 100 nm, increasing to over 40% at 
t= 10 nm. 
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Fig. 3.25 Thickness dependence of the probability P, of surface plasmon excitation and the asso- 
ciated energy loss (in units of E,/2, dashed curve) for normally incident electrons of speed v, 
calculated using a free-electron model for a specimen with two clean and parallel surfaces. From 
Ritchie (1957), copyright American Physical Society. The lower curve shows the reduction AP, 
in the probability of volume plasmon excitation, the begrenzungs effect. This effect has also been 
calculated for the case of a small sphere (Echenique et al., 1987). Available at http://link.aps.org/ 
abstract/PR/v 106/p874 


3.3.5.5 Retardation effects 


By solving Maxwell’s equations with appropriate boundary conditions, Kroger 
(1968) derived an expression for the differential probability of energy loss in a spec- 
imen of thickness ¢, including volume and surface losses, transition radiation, and 
retardation effects. Using notation similar to that of Erni and Browning (2008), the 
differential probability for a specimen of thickness f is 


dP I tw? 20°(e* — n*)? 
ai ( i YT A+B4O (3.84) 


— Im 
dQdE 1*agmyv? e*g? kogae 


Here e* = &; — i €2 is the complex conjugate of the dielectric function of the 
specimen and nis an equivalent quantity for the surroundings (7 = relative per- 
mittivity of the oxide, for a loss-free oxide coating on each surface). The angular 
terms ¢ = )? + Of? and 07 = Ao? + On”, where A2 = 62 — e*Og*(v/cy and 
ho? = 07 — n*Og*(v/c)’, are dimensionless, as is u* = 1 — e*(v/c)?. 

The first term in square brackets represents the Cerenkov-enhanced volume loss, 
equivalent to Eq. (3.70), the other terms representing surface plasmon excitation and 
the effect of the surfaces on Cerenkov emission. The surface plasmon term is 
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where dp = mtE/(hv), Lt = Age* + An*tanh(Ad./@z), and L~ = dAge* + 
An* coth(Ade/6z), as in Eq. (3.83a). The remaining terms account for guided-light 
modes in the specimen: 
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Vv A0PEYH, 1 1 : 
B= a aR aE sin(2d_) (3.84b) 
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where doi” = 0° + Og7[1 — (e* — n*)(v*/e")]. 

Equation (3.84) can be used to predict the angular and thickness dependence of 
the low-loss spectrum, as illustrated in Fig. 3.26. In silicon, the retardation effects 
are seen to occur at even smaller angles and energy loss than the surface plasmon 
loss, and their energy dependence is sensitive to specimen thickness below about 
1000 nm. For very thin specimens (t < 5 nm), the surface and bulk plasmon peaks 
largely disappear, leaving retardation and interband transitions as the main spectral 
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Fig. 3.26 (a) Schematic and contour plot (gray background) showing the calculated energy loss 
and angular dependence of intensity for a 50-nm Si specimen and 300-keV incident electrons. 
(b) Energy-loss spectra calculated from Eq. (3.84) for 2.1-mrad collection semi-angle, 300-keV 
incident electrons, and various thicknesses of silicon, assuming no surface-oxide later. The inten- 
sities are normalized and the spectra displaced vertically for clarity. Reproduced from Erni and 
Browning (2008), copyright Elsevier. A computer program for evaluating Eq. (3.84) is described 
in Appendix B9 
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features below 10 eV. This implies that direct bandgap measurements are most reli- 
ably made on isolated ultrathin structures, such as suspended nanotubes (Yurtsever 
et al., 2008). 

For silicon specimens of thickness between 25 and 310 nm, the E-dependence of 
scattering probability (for E < 5 eV) calculated from Eq. (3.84) shows good agree- 
ment with experiment (Yurtsever et al., 2008). Equation (3.84) can be generalized to 
the case of nonnormal incidence (Kréger, 1970; Mkhoyan et al., 2007). Anisotropic 
materials introduce further complications (Chen and Silcox, 1979). Cerenkov exci- 
tation by aloof electrons and adjacent to curved surfaces is analyzed by Garcia de 
Abajo et al. (2004). 


3.3.6 Surface-Reflection Spectra 


Instead of measuring transmitted electrons, energy-loss spectra can be recorded 
from electrons that have been reflected from the surface of a specimen. The depth of 
penetration of the electrons (perpendicular to the surface) depends on the primary 
beam energy and the angle of incidence 6; (see Fig. 3.27). For moderate angles, 
both bulk- and surface-loss peaks occur in the reflection spectrum; at a glancing 
angle (6; > 80°) the penetration depth is small and only surface peaks are observed, 
particularly if the spectrum has been recorded from specularly reflected electrons 
(angle of reflection = angle of incidence) and the incident energy is not too high 
(Powell, 1968). 


(b) 


0 20 40 0 20 40 60 80 surface 
Energy Loss (eV) 


Fig. 3.27 (a) Plasmon-loss spectra recorded by reflection of 8-keV electrons from the unoxidized 
surface of liquid aluminum. At glancing incidence, the spectrum is dominated by plural scattering 
from surface plasmons; as the angle of incidence is reduced, volume plasmon peaks appear at 
multiples of 15 eV. (b) Reflection of an electron at the surface of a crystal, showing the difference 
between the angles 6, and @ due to refraction. From Powell (1968), copyright American Physical 
Society. Available at http://link.aps.org/abstract/PR/v175/p972 
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In the case of a crystalline specimen, the reflected intensity is strong when the 
angle between the incident beam and the surface is equal to a Bragg angle for atomic 
planes which lie parallel to the surface, the specular Bragg condition. The intensity 
is further increased by adjusting the crystal orientation so that a resonance parabola 
seen in the reflection diffraction pattern (the equivalent of a Kikuchi line in trans- 
mission diffraction) intersects a Bragg-reflection spot, such as the (440) reflection 
for a {110} GaAs surface, giving a surface-resonance condition. The penetration 
depth of the electrons is then only a few monolayers, but the electron wave travels a 
short distance (typically of the order of 100 nm) parallel to the surface before being 
reflected (Wang and Egerton, 1988). 

The ratio P, of the integrated surface plasmon intensity, relative to the zero-loss 
intensity in a specular Bragg-reflected beam, has been calculated on the basis of 
both classical and quantum-mechanical theory (Lucas and Sunjic, 1971; Evans and 
Mills, 1972). For a clean surface (€g = 1) and assuming negligible penetration of 
the electrons, 


P, = e?/(8eqhv cos 6;) (3.85) 


which is identical to the formula for perpendicular transmission through a single 
interface, Eq. (3.77), except that the incident velocity v is replaced by its compo- 
nent v(cos 6;) normal to the surface. In the case of measurements made at glancing 
incidence, this normal component is small and P,; may approach or exceed unity.” 
Surface peaks then dominate the energy-loss spectrum (see Fig. 3.27); bulk plas- 
mons are observed only for lower values of 6; or (owing to the broader angular 
distribution of volume scattering) when recording the spectrum at inelastic scat- 
tering angles 6 away from a specular beam (Schilling, 1976; Powell, 1968). An 
equivalent explanation for the increase in surface loss as 6; — 90° is that the 
incident electron spends a longer time in the vicinity of the surface (Raether, 1980). 

Schilling and Raether (1973) have reported energy gains of h ws in energy-loss 
spectra of 10-keV electrons reflected from a liquid-indium surface at 6; ~ 88.5°. 
Such processes were measurable only with an incident beam current so high that the 
time interval between the arrival of the electrons was comparable with the surface 
plasmon relaxation time. Even under the somewhat-optimized conditions used in 
this experiment, the probability of energy gain was only about 0.2%. 

The electron microscope allows a reflection diffraction pattern to be observed 
and indexed, so the value of the incident angle 6; can be obtained from the Bragg 
angle 6 of each reflected beam, provided allowance is made for refraction of the 
electron close to the surface (see Fig. 3.27b). The refraction effect depends on the 
mean inner potential ¢o9 of the specimen; using relativistic mechanics to calculate 
the acceleration of the incident electron toward the surface gives 


Strictly speaking, P, is a “scattering parameter” analogous to #/A in Section 3.3.6, but approxi- 
mates a single-scattering probability if much less than unity. 
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2(ePo)(1 — v?/c?) 


2 2 2 
cos“6; = sin“, = sind 3.86 
i S B moce (v2 /c2) ( ) 
Since cos 6; = sin 0, ~ Os, where 6, is the angle between the incident beam 
and the surface (measured outside the specimen), Eq. (3.86) leads to the rela- 
tion P;@; = constant, which has been verified experimentally (Powell, 1968; 


Schilling, 1976; Krivanek et al., 1983). For a given Bragg reflection, the value of 
Ps calculated from Eqs. (3.85) and (3.86) tends to increase with increasing inci- 
dent energy; for the symmetric (333) reflection from silicon, for example, Py, is 
1.06 at 20 keV and 1.36 at 80 keV. Experimental values (Krivanek et al., 1983) 
are somewhat higher (1.4 and 1.8), probably because Eq. (3.85) neglects surface 
plasmon excitation during the brief period when the electron penetrates inside the 
crystal. 

Reflection energy-loss spectroscopy is a more surface-sensitive technique when 
the incident energy is low, partly because the electron penetration depth is smaller 
(the extinction distance for elastic scattering is proportional to electron velocity 
v) and because the electrostatic field of a surface plasmon extends into the solid 
to a depth of approximately 1/g; ~ (ko$z)7! = V/@p (for 6; © 2/2), which is 
proportional to the incident velocity. 


3.3.6.1 Nonpenetrating Incident Beam (Aloof Excitation) 


It is possible to measure energy losses of primary electrons that pass close to a 
surface but remain outside, such as when a finely focused electron beam is directed 
parallel to a face of a cubic MgO crystal (Marks, 1982). In the case of a metal, bulk 
plasmons are not excited and surface excitations can be studied alone. Classical, 
nonrelativistic theory gives for the excitation probability (Howie, 1983) 


dP, (x, E) _ 2z 


= 3.87 
dE IModgv ( ) 


+ Ko(2wx/v) Im Farsi 


e(E) + 1 


where z is the length of the electron path parallel to the surface (a distance x away), 
Ko(2@x/v) is a modified Bessel function and ¢(£) is the complex permittivity of 
the specimen, which is a function of the energy loss E = fh w. Equation (3.87) can 
be generalized to deal with the case where the specimen surface is curved (Batson, 
1982; Wheatley et al., 1984) or where the incident electron executes a parabolic 
trajectory as a result of a negative potential applied to the sample (Ballu et al., 1976). 
A range of impact parameter x can be selected by scanning the incident beam and 
using a gating circuit to switch the spectrometer signal on or off in response to 
the output of a dark-field detector (Wheatley et al., 1984). Bertsch et al. (1998) 
used dielectric theory to derive a formula for the energy-loss spectrum of electrons 
passing at different distances from a cylindrical nanowire. 
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3.3.6.2. Cross-Sectional Specimens 


Cross-sectional TEM specimens, of electronic devices for example, often contain 
interfaces that are perpendicular to the plane of the specimen and therefore parallel 
to the incident electron beam or can be made so by slightly tilting the speci- 
men. Bolton and Chen (1995a, b, c) applied dielectric theory (including retardation 
effects) to electrons traversing multilayers at both parallel and perpendicular inci- 
dence, and also moving within anisotropic media. Discrepancies with experimental 
results are discussed by Neyer et al. (1997) in terms of coupled interface modes and 
the begrenzungs effect. Moreau et al. (1997) measured energy-loss spectra for elec- 
trons traveling within 10 nm of a Si-SiO» boundary and showed that a relativistic 
analysis (Garcia-Mollina et al., 1985), including retardation effects, was necessary 
to account for shift of a 7-eV peak with distance of the electron trajectory from the 
interface. 

Theories of interface excitation have usually assumed that the interface is abrupt. 
The case of diffuse interface was investigated by Howie et al. (2008), who showed 
that for electrons traveling at various distances parallel to an interface, the plas- 
mon peaks differ in position, width, and shape from those expected for an abrupt 
interface and from the volume plasmon peaks that would reflect the local chemical 
composition. 


3.3.7 Plasmon Modes in Small Particles 


In the case of small particles, the surface-plasmon traveling waves become standing 
waves that can be characterized by the number of nodes, somewhat like the modes 
of vibration of a mechanical system. For an isolated spherical particle (relative 
permittivity ¢,) surrounded by a medium of permittivity ¢,, the surface-resonance 
condition is modified from Eq. (3.71) to become 


Eat [l/UA + Dep = 0 (3.88) 


where / is an integer. For a free-electron metal, Eq. (3.88) gives for the surface- 
resonance frequency: 


@s = @p[1 + ea + D/O? (3.89) 


The lowest frequency of these localized plasmonic modes corresponds to / = 1 
(dipole mode) and predominates in very small spheres (radius r < 10 nm). As the 
radius increases, the energy-loss intensity shifts to higher order modes and the reso- 
nance frequency increases asymptotically toward the value given by Eq. (3.73) fora 
flat surface. This behavior was verified by experiments on metal spheres (Fujimoto 
and Komaki, 1968; Achéche et al., 1986), colloidal silver and gold particles embed- 
ded in gelatin (Kreibig and Zacharias, 1970), and irradiation-induced precipitates of 
sodium and potassium in alkali halides (Creuzburg, 1966). 
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The probability of exciting a given mode, averaged over all possible trajectories 
of the fast electron, is of the form (Fujimoto and Komaki, 1968) 


[o,@) 
dP) 8hr = (ws\" wwe (21+ 13 | Viol (3.90) 
dw ~~ agmov?q* \ap (@? — w2) + wf? l a eee 
. or/v 
where [ = 1/t is the damping constant of the metal and J; is a spherical 


Bessel function. St6ckli et al. (1997) measured plasmon-loss spectra of aluminum 
nanospheres and carbon nanotubes of various diameters, interpreting the results in 
terms of dielectric theory; see also Garcia de Abajo and Howie (2002). 

Dielectric theory has also been used to predict the additional peaks that occur 
when spherical metal particles are attached to a substrate (Wang and Cowley, 
1987; Ouyang and Isaacson, 1989; Zabala and Rivacoba, 1991). For the case of 
small spheres embedded in a medium, Howie and Walsh (1991) have proposed 
an effective energy-loss function Im(—1/e)eg¢ that is geometrically averaged over 
different segments of a typical electron trajectory. They show that this function 
predicts the observed spectrum of irradiated AIF3 (containing small Al particles) 
more successfully than effective medium theories, which give formulas for an effec- 
tive permittivity € ere. Measurement of Im(—1/é)ere might yield the average size and 
volume fraction of fine precipitates or point-defect clusters in specimens whose 
structure is too fine in scale or too complex to permit direct imaging. 

Cavities in a metal or dielectric also have characteristic resonance frequencies, 
given for the case of a spherical void (¢ = 1) in a metal by 


Ws = Mpl(m + 1)/(2m + 1)! (3.91) 


The frequency decreases toward wp/./2 as the integer m increases from zero 
(Raether, 1980). As an example, helium-filled “voids” in Al/Li alloy (h wp = 15 eV) 
appear bright in the image formed from 11-eV loss electrons (Henoc and Henry, 
1970). 

If a spherical particle becomes oblate, its surface plasmon resonance splits into 
two modes, longitudinal and transverse to the long axis, the transverse mode having 
a somewhat higher energy. A similar situation exists for a dimer consisting of two 
spherical particles or a chain of such particles: as the interparticle spacing decreases, 
the longitudinal-mode redshift increases (Wang et al., 2010a). 

When two nanoparticles come into close contact, their plasmon modes inter- 
act electromagnetically, giving rise to hybridized plasmonic states that correspond 
to the near-field optical region, with an enhancement of electric field at the parti- 
cle surface. The strongest coupling occurs for longitudinal polarization, where the 
symmetric bonding states have nonzero dipole moment and are referred to as bright 
modes, since they are readily excited by incident photons. The antibonding states, 
dark modes, possess higher energy and an antisymmetric electric field with zero 
dipole moment. Quadrupole and higher order modes of single particles are also 
dark modes, as are propagating modes in nanoparticle chains. They interact only 
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weakly with incident light but are of practical importance because of the possibility 
of lossless waveguide transmission on a scale below the optical wavelength, useful 
in surface-enhanced Raman spectroscopy and in biosensing and lasing applications. 

A quantitative treatment is provided by the plasmon hybridization model, in 
which the plasmon modes are treated as bonding and antibonding states of a one- 
particle plasmon (Nordlander and Oubre, 2004). This model was used by Koh et al. 
(2009) to describe spectra for a sub-nanometer probe traveling within and outside 
spherical silver particles and pairs of particles (dimers). The modes can be displayed 
in energy-filtered images; in the limit of zero separation, they may differ by almost 
a factor of 2 in energy, corresponding to the multipole and monopole modes of a 
nanowire (Wang et al., 2009b). 

Plasmon-resonance modes of small particles and arrays of particles are con- 
veniently studied by STEM-EELS, as originally demonstrated by Batson (1982). 
Recent improvements in instrumentation have made possible images with sub- 
nanometer spatial resolution and sub-electron volt energy resolution. This capability 
has led to impressive energy-filtered images that display the geometry of the surface 
plasmon modes within individual nanoparticles, and in arrays of particles or holes, 
as discussed in Section 5.2.3. 


3.4 Single, Plural, and Multiple Scattering 


For a very thin specimen, the probability that a transmitted electron undergoes more 
than one scattering event is low. Neglecting energy broadening by the spectrome- 
ter system, the intensity J'(E) in the energy-loss spectrum then approximates to a 
single-scattering distribution (SSD) or single-scattering profile S(E): 


J'(E) © S(E) = Ionat(do /dE) + Ip(dPs/dE) (3.92) 


where Jp is the zero-loss intensity, approximately equal (because of the low scatter- 
ing probability) to the total area J, under the loss spectrum. Here, na is the number of 
atoms (or molecules) per unit volume of the specimen and f is the specimen thick- 
ness within the irradiated area. The energy-differential cross section per atom (or 
molecule) do/dE is obtained by integrating the double-differential cross section, 
given by Eq. (3.29) or (3.32), up to a scattering angle equal to the collection semi- 
angle 6 used when acquiring the spectrum. The last term in Eq. (3.92) represents the 
intensity arising from surface-mode scattering. Integration over energy loss E gives 
the total single-scattering intensity: 


1(B) = Ipnato (B) + loPs(B) = lolt/M(B) + Ps(B)] (3.93) 


where o(f) is an integral cross section, given by Eq. (3.57) for the case of volume 
plasmon excitation. 
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3.4.1 Poisson’s Law 


If inelastic scattering is viewed in terms of collisions that are independent events, 
their occurrence should obey Poisson statistics: the probability that a transmitted 
electron suffers n collisions is P, = (1/n!) m” exp(—m), where m is the mean num- 
ber of collisions incurred by an electron traveling through the specimen. Therefore, 
we set m = t/A, where A is the average distance between collisions, the mean free 
path for inelastic scattering. Sometimes #/A is referred to as the scattering param- 
eter of the specimen. P,, is represented in the energy-loss spectrum by the ratio of 
the energy-integrated intensity of n-fold scattering [,, divided by the total integrated 
intensity J;: 


Pa = In/Iy = (1/ni(t/A)" exp(—t/A) (3.94) 
The variation of P,, with scattering parameter is shown in Fig. 3.28. For a given order 
n of scattering, the intensity is highest when t/A = n. In the case of the unscattered 


(n = 0) component (zero-loss peak), the intensity is therefore a maximum at t = 0 
and decreases exponentially with specimen thickness. For n = 0, Eq. (3.94) gives 


t/a = In(h/Io) (3.95) 


Equation (3.95) provides a simple way of measuring the thickness of a TEM 
specimen from the energy-loss spectrum; see Section 5.1. 


From Eq. (3.94), the average energy loss (E) per incident electron is 


(E) =) PrEn = )~ i (-)’ exp(—t/A) (3.95a) 
0 0 , 
1.0 
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where the low-loss spectrum is approximated as a series of sharp peaks at multiples 
of Ep. In that case, 


a 
(E) = Ep= exp(— mye (t/ we 2 Fox (3.95b) 


Therefore there is a simple relationship between (E), the average loss per incident 
electron, and Ey, which is the average energy loss per inelastic scattering event. 


The following qualifications relate to Eq. (3.94), which is known as 
Poisson’s law: 


(1) Angles of scattering are assumed to be small, making the distances that elec- 
trons travel through the specimen almost identical for the different orders n of 
scattering. 

(2) If several energy-loss processes (each characterized by a different mean free 
path 4;) occur within the energy range over which the spectral intensity is inte- 
grated in Eqs. (3.93) and (3.94), the number of scattering events is additive and 
the effective scattering parameter is 


t/h =) > 4/dj (3.96) 


J 


If the transmitted electron passes through several layers, ¢; represents the 
thickness of layer j, whereas if several scattering processes occur within a 
single-layer specimen, each fj is equal to the specimen thickness. 

Although Eq. (3.94) refers to bulk processes, surface-mode scattering can be 
included by using the surface-loss probability Ps; (see Section 3.3.5) as a sec- 
ond scattering parameter. For normal incidence, P, is sufficiently small (<5%) 
that second- and higher order surface scattering is negligible, but for reflection 
at grazing incidence Ps, can exceed unity and multiple surface plasmon peaks 
dominate the spectrum (see Fig. 3.26). For this situation, the validity of Poisson 
statistics has been confirmed experimentally (Schilling, 1976). 

Equation (3.94) is exact only if the specimen is of uniform thickness within 
the area from which the spectrum is recorded. The effect of thinner regions can 
be visualized by imagining a hole to occur within the analyzed area; electrons 
passing through the hole contribute to the zero-loss intensity Jo but not to other 
orders of scattering. Breakdown of Eq. (3.94) then leads to inaccuracy in the 
removal of plural scattering by Fourier log deconvolution, as will be discussed 
in Chapter 4. 

The use of Poisson statistics is justified if all scattering events contribute to the 
measured intensities [,. However, the energy-loss spectrum is often recorded 
with an angle-limiting aperture that accepts only a fraction F;,(6) of the elec- 
trons of a given order. In this situation, Eq. (3.94) retains its validity only if the 
aperture factors obey the relationship 
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F,(B) = [F1(B))" (3.97) 


If Eq. (3.97) applies, substitution into Eq. (3.94) gives the measured intensity 
as In(B) = Fr(B) (1/n!) (t/A)" exp(—t/A) ke = (1/n!) (Fit/A)" exp(—t/A) 
T= (/n!) (Fit/A)" exp[—t/A(B) hk, where A(B) = 4/F1(B), so that 


Tn(B)/1AB) = (A/n!)[t/2(B)]" exp(—1/2) / eC /nlt/A(B)1" exp(—t/A) 
= (1/n!)[t/A(B))" exp[—t/A(B)] 
(3.98) 


where /,(B) is the total measured intensity. In this case, Poisson’s law remains 
valid, provided the mean free path 4 is replaced by an aperture-dependent mean 
free path 4(8) = 4/F (6), inversely related to the integral cross section o(B) 
for scattering within the collection aperture: 


MB) = [nato(B)1~! (3.99) 


where n, is the number of atoms per unit volume within the specimen. If 
the angular distribution of scattering is Lorentzian with an abrupt cutoff at a 
scattering angle 0., Eq. (3.57) is valid and 


rr o(B) In(1 + B?/05 ) 
M(B) In(1 + 62/62.) 


F\(B) = (3.100) 


As a result of this logarithmic dependence on the collection semi-angle, A(6) 
is somewhat longer than the mean free path 4 for scattering through all angles. 
However, to justify the validity of Eqs. (3.97) and (3.98), we need to examine 
the angular distribution of plural scattering. 


3.4.2 Angular Distribution of Plural Inelastic Scattering 


In the case of double scattering, the intensity per unit solid angle dJ?/dQ is a two- 
dimensional convolution of the single-scattering angular distribution. Using polar 
coordinates to represent the radial component @ and the azimuthal component @ of 
scattering angle, this convolution can be represented (Fig. 3.29) as 


dJ?(0) do (62) [ do(61) 
a6 «|| 46 ii rom Jes (3.101) 
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Fig. 3.29 (a) Geometry of double scattering from O to P, showing scattering angles projected onto 
a plane perpendicular to the optic axis. The individual scattering angles are 0; and 6; the total 
angular deflection after double scattering is 9. (b) Scattering per unit solid angle calculated for the 
first four orders; triangles represent double scattering obtained from Eq. (3.102). (c) Scattering per 
unit solid angle, calculated up to high scattering angle and assuming an abrupt cutoff at 6. = 80 Og. 
(d) Fraction F;, of inelastic scattering (or order n) collected by an aperture of semi-angle B. The 
data points show F2 and F3 calculated using two different algorithms; dashed curves represent the 
square and cube of F;. From Egerton and Wang (1990), copyright Elsevier 


where the integration is over all solid angles (21. For a Lorentzian do /dQ with no 
cutoff, the integration can be represented analytically: 


dJ*/dQ « 67 "{In{l — uv + uw — u/w] 


= Inf — 2u2/v + wu2/v?)'/? + wu/v — u/w)} (3.102) 


where u = 6/0g, v = 1+ 77/02, and w = (4+ 67/02)'/. For 6g KO «Zz, 
Eq. (3.101) becomes 


dl’? /d2. « 67*In(6/6z) (3.103) 
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Therefore, at higher scattering angles the intensity falls off a little more slowly than 
the @~? dependence of single scattering (Fig. 3.29c). 

A truncated Lorentzian angular distribution of single scattering can be specified 
by introducing a function H(@) that changes from | to 0 as @ passes through @¢, so 
that Eq. (3.101) becomes 


T 20 
dJ?(6 HO HO 
care - / | | Elo dacae, (3.104) 
dQ 0; +05 | | 03 + OF 
0 


where 05 =@4+ 6? — 206; cos(g1), applying the cosine rule to the vector triangle 
in Fig. 3.29a. Equation (3.104) can be evaluated numerically, considerable compu- 
tation being needed to achieve good accuracy. Exploiting the fact that the inelastic 
scattering has axial symmetry, the double integral can be replaced by a single inte- 
gral involving a Bessel function, the so-called Hankel transform (Bracewell, 1978), 
with a reduction in computing time. 

Extending Eq. (3.104), the intensity of n-fold scattering can be computed as 
an n-fold convolution of do/d&. Calculated angular distributions are shown in 
Fig. 3.29b, c. Relative to the half-width 0 of single scattering, the half-widths of 
the double, triple, and quadruple scattering distributions are increased by factors of 
2.6, 5.1, and 7.5, respectively. Since some plural scattering intensity extends as far 
as an angle n@¢, the “cutoff” at 6 = 6, becomes more gradual as n increases. 

To calculate the attenuation factor F;,, the angular distribution of n-fold scattering 
must be integrated up to an angle 6 and divided by the integral over all scattering 
angles. Results for n = 2 and n = 3 are shown in Fig. 3.29d, in comparison to 
the square and cube of F; (dashed curves). Although discrepancies are observable 
close to 6, and at small angles, Eq. (3.97) is found to be accurate to within 3% for 
B > 15 Og (Egerton and Wang, 1990; Su et al., 1992). Equation (3.97) has also 
been inferred from the results of Monte Carlo calculations (Jouffrey et al., 1989) 
and has been verified experimentally from deconvolution of plasmon-loss spectra 
(see Fig. 4.2). 

For double scattering (n = 2), Eq. (3.97) can be proved mathematically if we 
take B > Oz, since if we assume F2 = (Fi), then 


dJ* /dQ « 0~!dJ? /d0 « 6~'(dF/d0) 


«x 0~!d[In?(6/0¢)|/do « 0~7 In(6/Oz) (3-105) 


which is of the same form as Eq. (3.103), which is valid for 6 >> 0¢. Consequently, 
Eq. (3.97) appears to be a property of the 6~* tail of the Lorentzian single-scattering 
angular distribution. 
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3.4.3 Influence of Elastic Scattering 


So far, our discussion of angular distributions and plural scattering probabilities 
has made no reference to elastic scattering, even though the probability of such 
scattering is comparable to that of inelastic scattering; see page 126. In general, the 
angular width of inelastic scattering is much less than that of elastic scattering (see 
Section 3.2) and the collection semi-angle f used in spectroscopy is often less than 
the characteristic angle 69 of elastic scattering or (for a crystalline specimen) the 
scattering angle 26, of the lowest order diffraction spots. Under these conditions, 
the total intensity /,(8) in the spectrum is reduced to a value that is considerably 
below the incident beam intensity J, particularly for thicker specimens. Although 
this reduction of intensity is the most important effect of elastic scattering on the 
energy-loss spectrum, we will now consider its effect on the relative intensities of 
the different orders of inelastic scattering recorded by a spectrometer. 

In an amorphous material, elastic and inelastic scattering are independent and 
both are governed by Poisson statistics, so the joint probability of m elastic and n 
inelastic events is 


P(m, n) = (Xe/m!)(t/Ae)” Oxi/n!)(t/Ai)" (3.106) 


where %¢ = exp(—t/Ac) and x1 = exp(—t/Aj); Ae and A; are the mean free paths 
for elastic and inelastic scattering through all angles. If a fraction F(m, n) of the 
electrons passes through a collection aperture of semi-angle 6, the recorded zero- 
loss component J() for an incident beam intensity J is given by 


In (B)/1 = Y~ Pm, 0) F(m, 0) = xexi + x0x1 )> (1/ml)(t/de)"F(B) (3.107) 
0 1 


Here, xex; represents the unscattered electrons and F*,(B) is the fraction of m-fold 
purely elastic scattering that passes through the aperture. Calculations (Wong and 
Egerton, 1995) based on the Lenz model suggest that the m-fold elastic scattering 
distribution approximates to a broadened single-scattering angular distribution with 
00 replaced by (0.7 + 0.5m) 60, so that Eq. (3.6) becomes, for m > 1 


Fe (B) © [1+ (0.7 +0.5m)* 63/67)! (3.108) 


The inelastic intensity [;(B) transmitted through the aperture, integrated over all 
orders of scattering, is given by 


Ii(B)/I = > >> P(n,m) F(n,m) 
m=0 — (3.109) 
= Xx; DO (1/n!)(t/Ai)" Fi (B) + >> > P(n,m)F(m, n) 


n=1 m=1n=1 
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F '(B) is the fraction of inelastic scattering that passes through the aperture, previ- 
ously denoted F’,(6), and the final term in Eq. (3.109) represents electrons that have 
been scattered both elastically and inelastically. Calculations (Wong and Egerton, 
1995) of the angular distribution of this “mixed” scattering, based on Lenz model 
single-scattering distributions, indicate that the corresponding aperture function can 
be approximated by a simple product: 


F(m,n) © Fé,(B)Fi(B) (3.110) 


Most likely, Eq. (3.110) results from the fact that the elastic and inelastic scattering 
have very different angular widths. Now Eq. (3.109) can be rewritten, making use 
of Eqs. (3.107) and (3.97), as 


[(B)/T = Xexi = (1/n!)(t/Aiy" Fi, (B) [ a x (t/he)"F; w(0)| 


cath (3.111) 
= [o(B)/D) [exp(F)t/Ai) — 1] 


Writing the total intensity recorded through the aperture as [,(8) = Jo(6)+4,(6) and 
defining t/A;(8) = Fi (t/A;) as before, Eq. (3.111) becomes 


t/Ai(B) = In[h(B)/Zo(B)] (3.112) 


which is the same as Eq. (3.95), derived previously without considering elastic scat- 
tering. This equation, which is used for measuring specimen thickness (Section 5.1), 
can therefore be justified mathematically on the basis of angular distributions (of 
elastic and inelastic scattering) that are found to be good approximations for amor- 
phous specimens (Wong and Egerton, 1995). Experimentally, Eq. (3.112) has been 
verified to within 10% for t/A; < 5 in amorphous, polycrystalline, and single-crystal 
specimens (Hosoi et al., 1981; Leapman et al., 1984a). 

Elastic scattering has a greater influence on the recorded intensity of inner-shell 
inelastic scattering (whose angular width is often comparable to that of elastic scat- 
tering), as discussed in Section 4.3.2.2. In crystalline specimens, additional effects 
occur as a result of channeling; see Section 3.1.4. 


3.4.4 Multiple Scattering 


For relatively thick specimens (t/A > 5), individual peaks may not be visible in 
the loss spectrum; multiple outer- and inner-shell processes combine to produce a 
Landau distribution (Whelan, 1976; Reimer, 1989) that is broadly peaked around 
an energy loss of some hundreds of electron volts; see Fig. 3.30. The position of 
the maximum (the most probable energy loss) is roughly proportional to specimen 
thickness (Perez et al., 1977; Whitlock and Sprague, 1982) and is very approxi- 
mately (t/A)Ep, where Ep is the energy of the main peak in the single-scattering 
distribution. This multiple scattering behavior is sometimes called straggling. 
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When the number n of events is large, the angular distribution of scattering tends 
toward a Gaussian function, a consequence of the central limit theorem (Jackson, 
1975). The mean-square angular deflection is n (67), where (07) is the mean-square 
angle for single scattering, given by Eq. (3.54) for the case of a truncated Lorentzian 
angular distribution of single scattering. 


3.4.5 Coherent Double-Plasmon Excitation 


A fast electron traveling through a solid can in principle lose a characteristic amount 
of energy equal to 2/ w, in a single scattering event. In accordance with Eq. (3.94), 
the intensity at an energy loss E = 2h wp should therefore be 


Typ = Inl(t/Ap)”/2 + t/Aap] (3.113) 


where the first term represents the incoherent production of two plasmons in sep- 
arate scattering events (in accordance with Poisson’s law) and the second term 
represents coherent double-plasmon excitation, characterized by a mean free path 
A2p. As seen from Eq. (3.113), the coherent contribution is fractionally greater in the 
case of very thin specimens. It should be visible directly if the energy-loss spectrum 
is deconvolved to remove the incoherent plural scattering. 

Based on a free-electron model, Ashley and Ritchie (1970) deduced that the rel- 
ative probability P,.j of the double process is proportional to the fifth power of the 
cutoff wavevector qc. Taking gc © Wp/vr their formula becomes 


Prot = Ap/Aap © 0.01372 (3.114) 


where rs is the radius of a sphere containing one free electron, divided by the Bohr 
radius ao. The lowest free-electron density (corresponding to rs = 5.7) occurs 
in cesium, giving P;. = 0.34. For aluminum, Eq. (3.114) yields 7, = 2.0 and 
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Pre) = 0.04; by calculating a many-body Hamiltonian, Srivastava et al. (1982) 
obtained Pyey = 0.024. 

Experimental determinations for aluminum have produced disparate values: 
0.135 (Spence and Spargo, 1971), 0.07 (Batson and Silcox, 1983), <0.03 (Egerton 
and Wang, 1990), and <0.005 (Schattschneider and Pongratz, 1988). The reason 
for these discrepancies is unknown, although Schattschneider has pointed out that 
small holes in the specimen or a variation in thickness would cause an overestimate 
of Pye) after Fourier log deconvolution. Although the double-plasmon process is of 
interest in terms of nonlinear physics and plasmon-electron coupling, its apparent 
low probability suggests that it can usually be neglected in quantitative analysis of 
the low-loss spectrum. 


3.5 The Spectral Background to Inner-Shell Edges 


Each ionization edge in the energy-loss spectrum is superimposed on a downward- 
sloping background that arises from the excitation of electrons of lower binding 
energy, and which may have to be subtracted in the process of elemental analysis or 
interpretation of core-loss fine structure. Since the background is often comparable 
to or larger than the core-loss intensity, accurate subtraction is essential, as dis- 
cussed in Chapter 4. In general, it is desirable to minimize the background intensity, 
requiring an understanding of the energy-loss mechanisms that give rise to the back- 
ground. In this section, we discuss contributions to the background in the case of a 
very thin specimen, then consider the effect of plural scattering, which is important 
for thicker specimens. 


3.5.1 Valence-Electron Scattering 


For energy losses below 50 eV, inelastic scattering from outer-shell electrons is 
largely a collective process in the majority of solids. A “plasmon” peak usually 
occurs in the range of 10-30 eV, above which the intensity falls monotonically with 
increasing energy loss. Integration of Eq. (3.32) up to a collection semi-angle £ 
large compared to the characteristic angle 6; gives 


do/dE x Im(—1/e) In(6/6z) (3.115) 


The Drude expression for Im(—1/e), Eq. (3.43), is proportional to E~? for large E, 
so the falloff of intensity within the plasmon “tail” should vary roughly as E~*, if 
we ignore the logarithm term in Eq. (3.115). 

At large energy loss, however, the scattering is likely to have a single-electron 
character (see Section 3.3.2) and is more appropriately described by Bethe the- 
ory, using the equations that will be applied to inner-shell excitation in Section 
3.6. In particular, do/dE is expected to have a power-law energy dependence, as 
in Eq. (3.154), with an exponent of the order of 4 or 5 for small values of 6. Such 
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behavior was confirmed by measurements of the valence-electron scattering from 
thin films of carbon, in the energy range of 100-280 eV (Egerton, 1975; Maher 
et al., 1979). 

Since the binding energy of a valence electron is small compared to the energy 
losses under consideration, a large proportion of the background intensity will occur 
at large scattering angles, in the form of a Bethe ridge (Fig. 3.36). Again, experimen- 
tal data on carbon support this; see Fig. 3.31. In contrast, the inner-shell electrons 
have large binding energies and their inelastic scattering is forward peaked with 
an approximately Lorentzian angular distribution. Therefore, for the same energy 
loss, the core-loss intensity is concentrated into smaller scattering angles than the 
background. Using a small collection aperture to record the energy-loss spectrum 
therefore enhances the edge/background ratio, as seen in Fig. 3.32a. However, 
this small aperture results in a weak core-loss signal, giving rise to a relatively 
large shot-noise component in the spectral data and a low signal/noise ratio (SNR). 
Consequently, the SNR is often optimum at some intermediate value of collection 
semi-angle, typically around 10 mrad; see Fig. 3.32b. 


3.5.2 Tails of Core-Loss Edges 


In addition to the valence electrons, inner-shell electrons of lower binding energy 
may contribute intensity to the background underlying an ionization edge. If the 
preceding edge is prominent and not much lower in binding energy, the angular 
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Fig. 3.32 (a) Signal/background ratio (x /Ip) and (b) signal/noise ratio (SNR) as a function of 
collection semi-angle, measured for the boron and oxygen K-edges of B2O3 (Egerton et al., 1976). 
The parameter / is a factor that occurs in the formula for SNR (see Section 4.4.4) and depends on 
the widths of the background-fitting and extrapolation regions 


distribution of the background will be forward peaked and of comparable width to 
that of the edge being analyzed. The advantage of a small collection angle (in terms 
of signal/background ratio) is then less than where the background arises mainly 
from valence-electron excitation. But if the two edges are well separated in energy, 
the angular distributions may be sufficiently dissimilar to allow a significant increase 
in signal/background ratio with decreasing collection angle, as in the case of the 
oxygen K-edge in B203 (Fig. 3.32a). 


3.5.3 Bremsstrahlung Energy Losses 


When a transmitted electron undergoes centripetal acceleration in the nuclear field 
of an atom, it loses energy in the form of electromagnetic radiation (Bremsstrahlen). 
Although “coherent bremsstrahlung” peaks can be recorded from crystalline speci- 
mens in certain circumstances (Spence et al., 1983; Reese et al., 1984), the energy 
spectrum of the emitted photons usually forms a continuous background to the 
characteristic peaks observed in an x-ray emission spectrum. 

The differential cross section for bremsstrahlung scattering into angles less than 
6 can be written as (Rossouw and Whelan, 1979) 


do/dE = CE~'Z?(v/c)~* In{1 + (B/6e)"] (3.116) 


where C = 1.55 x 107+! m? per atom. Equation (3.116) neglects screening of the 
nuclear field by the atomic electrons, but is sufficient to show that (for energy losses 
below 4 keV) the bremsstrahlung background in the energy-loss spectrum is small 
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in comparison with that arising from electronic excitation (Isaacson and Johnson, 
1975; Rossouw and Whelan, 1979). 


3.5.4 Plural-Scattering Contributions to the Background 


Within the low-loss region (E < 100 eV), plural scattering contributes significant 
intensity unless the specimen thickness is much less than the plasmon mean free 
path (of the order of 100 nm for 100-keV electrons; see Section 3.3.1). At an energy 
loss of several hundred electron volts, however, multiple scattering that involves 
only plasmon events makes a negligible contribution, since the required number n 
of scattering events is large and the probability P,, becomes vanishingly small as a 
result of the n! denominator in Eq. (3.94). For example, a multiple plasmon loss of 
10Ep requires (on the average) 10 successive scattering events, giving Py < 10-6 
for a sample thickness equal to the plasmon mean free path. 

Similarly, it can be shown that the probability of two or more inelastic events of 
comparable energy loss is negligible when the total loss is greater than 100 eV. For 
example, if the single-scattering probability P(E) is of the form AE~’, the proba- 
bility of two similar events (each of energy loss E/2) is 2?"[P(E)|*, which at high 
energy loss is small compared to P(E) because of the rapid falloff in the differential 
cross section. 

However, the probability of two dissimilar energy losses can be appreciable, 
as illustrated by the following simplified model (Stephens, 1980). The low-loss 
spectrum is represented by a series of sharp (6-function) peaks at multiples of 
the plasmon energy Ey, the area under each being given by Poisson statistics. The 
energy dependence of the single-scattering background (arising from inner-shell or 
valence single-electron excitation) is taken to be J '(E) = AE~", with A and r as 
constants. Provided that scattering events are independent, the joint probability of 
several events is the product of the individual probabilities. Therefore the intensity 
at an energy loss E, due to one single-electron and n plasmon events, is 


J'+"(E) = ACE — nEp)"(t/A)" exp(—t/A)/n! (3.117) 


where A is the plasmon mean free path. This equation allows the contributions 
from different orders of scattering to be compared for different values of 1/A; 
see Fig. 3.33 and Appendix B, Section B.10. 

Plural scattering contributions to an ionization edge can be evaluated in a similar 
way (Fig. 3.33). Since the double (core-loss + plasmon) scattering is delayed until 
an energy loss E = Ex + Ep, the core-loss intensity just above the threshold Ex 
represents only single core-loss scattering. Defining the jump ratio (JR) of the edge 
as the height of the initial rise divided by the intensity of the immediately preceding 
background, JR is seen to decrease with increasing specimen thickness, because of 
plural scattering contributions to the background; see Fig. 3.33. 

An alternative measure of edge visibility is the signal/background ratio (SBR), 
measured as core-loss intensity integrated over an energy range A above the 
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Fig. 3.33 Contribution of plasmon scattering (up to order n) to the carbon K-edge (solid lines) and 
to its background (dashed curve), calculated using Eq. (3.117). In this simple model, the contribu- 
tion of each successive order to the edge profile is visible as a sharp step in intensity; in practice, 
these steps are rounded and in some materials barely visible. The specimen thickness is (a) one 
half and (b) twice the total-inelastic mean free path 


threshold, divided by the background integrated over the same energy range. SBR 
decreases with thickness less rapidly than JR because some plural scattering is 
included in the core-loss integral and for sufficiently large A, SBR should vary 
little with specimen thickness. In practice, plural scattering makes background 
extrapolation and measurement of the core-loss integral more difficult in thicker 
specimens. 

Despite the approximations involved, Eq. (3.117) agrees quite well with mea- 
surements of SBR for low-energy ionization edges; see Fig. 3.34a. In the case of 
higher energy edges (Fig. 3.34b), the agreement can be less satisfactory because of 
spectrometer contributions to the background (Section 2.4.1). Although extremely 
thin specimens give the highest signal/background ratio, the core-loss signal itself is 
very weak, resulting in a high fractional noise content. As a result, the signal/noise 
ratio (SNR) is optimum at some intermediate thickness, as illustrated in Fig. 3.34b. 
SNR determines the visibility of an edge and the minimum detectable concentration 
in elemental analysis; see Section 5.5.4. 

A more general method of computing plural scattering contributions is by 
self-convolution of the single-scattering energy distribution. Using this approach, 
Leapman and Swyt (1983) showed that plural scattering can cause the background 
exponent r to decrease with increasing sample thickness; see Fig. 3.35. The change 
in ris particularly large for energy losses just above a major ionization edge (e.g., in 
the range of 300-400 eV for a carbonaceous sample) and is attributable to a change 
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Fig. 3.34 Signal/background ratio (SBR) and signal/noise ratio (SNR) for K-edges of (a) elemen- 
tal carbon and (b) pure silicon, as a function of specimen thickness. Solid curves represent Eq. 
(3.117) and squares are measurements for 10-mrad collection semi-angle, 1-keV incident energy, 
and 100-eV integration windows. SNR (in arbitrary units) was calculated as J, /(% + hI,)'/2, as 
discussed in Section 4.4.3 
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Fig. 3.35 Change in the slope parameter r that characterizes the energy dependence (AE) of a 
carbon loss spectrum, calculated as a function of specimen thickness ¢ (Leapman and Swyt, 1983). 
The four chosen values of energy loss would immediately precede minor edges due to calcium, 
nitrogen, and oxygen in the spectrum of a biological specimen. The slope parameter is assumed 
to be constant and equal to 3 in the absence of plural scattering (¢t = 0). From Leapman and Swyt 
(1983), with permission from San Francisco Press 
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in overall shape of the major edge as a result of “mixed” scattering (see Section 
3.7.3). The pre-edge background and jump ratio can also be calculated by Monte 
Carlo methods (Jouffrey et al., 1985), allowing the precise effects of a collection 
aperture to be included. 


3.6 Atomic Theory of Inner-Shell Excitation 


Inner-shell electrons have relatively large binding energies and the associated energy 
losses are typically some hundreds of electron volts, corresponding to the x-ray 
region of the electromagnetic spectrum. As a result of this strong binding to the 
nucleus, collective effects are relatively unimportant. Inner-shell excitation can 
therefore be described to a first approximation in terms of single-atom models. 


3.6.1 Generalized Oscillator Strength 


The key quantity in Bethe theory (Section 3.2.2) is the generalized oscillator 
strength (GOS) that describes the response of an atom when a given energy and 
momentum are supplied from an external source (e.g., through collision of a fast 
electron). In order to calculate the GOS, it is necessary to know the initial- and final- 
state wavefunctions of the inner-shell electron. Calculations are based on several 
different methods. 


3.6.1.1 The Hydrogenic Model 


The simplest way of estimating the GOS, and the first to be developed (Bethe, 
1930), is based on wave mechanics of the hydrogen atom. This approach is of 
interest in energy-loss spectroscopy because it provides realistic values of K-shell 
ionization cross sections with a minimum amount of computing, enabling cross 
sections needed for quantitative elemental analysis to be calculated online. This 
relative simplicity arises from the fact that analytical expressions are available 
for the wavefunctions of the hydrogen atom, obtained by solving the Schrédinger 
equation: 


(—h? /2mo) V7 — (e?/4r eon) = Ew (3.118) 


where F£; is the “net” (kinetic + electrostatic) energy of the atomic electron. 

To make Eq. (3.118) applicable to an inner-shell electron within an atom of 
atomic number Z, the electrostatic term must be modified to take into account 
the actual nuclear charge Ze and screening of the nuclear field by the remaining 
(Z — 1) electrons. Following Slater (1930), an effective nuclear charge Zse is used 
in the Schrédinger equation. In the case of K-shell excitation, the second 1s elec- 
tron screens the nucleus and reduces its effective charge by approximately 0.3e, 
giving Z, = Z — 0.3. For L-shell excitation: Z,; = Z — (2 x 0.85) — (7 x 0.35), 
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allowing for the screening effect of the two K-shells and seven remaining L-shell 
electrons. Outer electrons (all those whose principal quantum number is higher than 
that of the initial-state wavefunction) are assumed to form a spherical shell of charge 
whose effect is to reduce the inner-shell binding energy by an amount Es, so that the 
observed threshold energy for inner-shell ionization is 


Ex = Z°R-E, (3.119) 


where R = 13.6 eV is the Rydberg energy. The Schrédinger equation of the atom is 
therefore 


(—h?/2mo)V>h — (Zse”/4ceor)W + Es = Ew (3.120) 


The net energy FE; of the excited electron is related to its binding energy E, and the 
energy E lost by the transmitted electron: 


E, = E— Ex (3.121) 
Substituting Eqs. (3.119) and (3.120) into Eq. (3.121) gives 
(—h?/2mo) V7 — (Zee”/4eor)W = (E — Z2R) (3.122) 


This is the Schrddinger equation for a “hydrogenic equivalent” atom with nuclear 
charge Z,e and no outer shells. Since the wavefunctions remain hydrogenic (or 
“Coulombic’’) in form, standard methods can be used to solve for the wavefunction 
w and for the GOS. 

For convenience of notation, we can define dimensionless variables Q’ and ky 
that are related (respectively) to the scattering vector g and the energy loss E of the 
fast electron: 


QO! = (qao/Zs)" (3.123) 
ke = E/(Z2R) — 1 (3.124) 


The generalized oscillator strength (GOS) per atom is then given, for E > ZR and 
for K-shell ionization (Bethe, 1930; Madison and Merzbacher, 1975), by 


dfx _ 256E(Q! + ke /3 + 1/3) exp(—28 /ky) 


3.125 
4E ( ) 


7 Z4R2(Q' — 1, + 1Y + 4k] [1 — exp(—2z-/kx)] 


where ’ is the value of arctan [2ky/(Q’ — ke + 1)] that lies within the range 0 to z. 
Energy losses in the range Ex < E < ZR correspond to transitions to discrete states 
in the hydrogenic equivalent atom and an imaginary value of ky. Substitution of this 
imaginary value in Eq. (3.125) gives (Egerton, 1979) 
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dfx _ 256E(Q! + k?,/3 + 1/3) exp(y) 


7 - 3 (3.126) 
ZER?U(Q! — key + 1 + 4G] 
where 
(i age AOC 
y = —(-ky)!/7log, 2 Ht ru 7 (3.127) 
O'+1—k, — 2(-k) 


Corresponding hydrogenic formulas have been derived for the L-shell GOS (Walske, 
1956; Choi et al., 1973) and for M-shell ionization (Choi, 1973). 


3.6.1.2 Hartree—Slater Method 


Accurate wavefunctions have been computed for most atoms by iterative solution 
of the Schrédinger equation with a self-consistent atomic potential. The Hartree— 
Slater (HS or HFS) method represents a simplification of the Hartree-Fock (HF) 
procedure, by assuming a central (spherically symmetric) field within the atom. The 
resulting wavefunctions are close to those obtained using the HF method but require 
much less computing. The radial component ¢o(7) of the ground-state wavefunction 
has been tabulated by Herman and Skillman (1963). For calculation of the GOS, the 
final-state radial function ¢, is obtained by solving the radial Schrédinger equation 
for a net (continuum) energy E;: 


V(r) 


ne a VU + 1h 
2mo dr2 2mor2 


+ é 6,(r) = 0 (3.128) 


where /’ is the angular momentum quantum number of the final (continuum) state. 

Using a central-field model, it is not possible to provide an exact treatment of 
electron exchange, but an approximate correction can be made by assuming an 
exchange potential of the form (Slater, 1951) 


VW = —6[(3/82)e(n)) 1 (3.129) 


where p(r) is the spherically averaged charge density within the atom. 
The transition matrix element, defined by Eq. (3.23), can be written (Manson, 
1972) as 


2 
(3.130) 


oo 2 
bal = ets) DOs) / Gola (qr)bu(rdr le ; 0) 
t 0 


where the operator exp(iq - r) has been expanded in terms of spherical Bessel func- 
tions J;,(gr) and the integration over angular coordinates is represented by a Wigner 
(3 — j) matrix. The GOS is obtained from Eq. (3.24), summing over all important 


3.6 Atomic Theory of Inner-Shell Excitation 187 


partial waves corresponding to different values of /’. Such calculations have been 
carried out by McGuire (1971), Manson (1972), Scofield (1978), Leapman et al. 
(1980), and Rez (1982, 1989); the results will be discussed later in this section and 


in Section 3.7.1. 


3.6.1.3 E- and q-Dependence of the GOS 


The generalized oscillator strength is a function of both the energy EF and the 
momentum fig supplied to the atom and is conveniently portrayed as a two- 
dimensional plot known as a Bethe surface, an example of which is shown in 
Fig. 3.36. The individual curves in this figure represent qualitatively the angu- 
lar dependence of inner-shell scattering, since the double-differential cross section 
da/dQdE is proportional to E~'q~*df /dE, as in Eq. (3.26), while q* increases 
approximately with the square of the scattering angle, as in Eq. (3.27). For an energy 
loss not much larger than the inner-shell binding energy E;, the angular distribution 
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Fig. 3.36 Bethe surface for K-shell ionization of carbon, calculated using a hydrogenic model. 
The generalized oscillator strength is zero for energy loss E below the ionization threshold Ex. 
The horizontal coordinate is related to scattering angle. From Egerton (1979), copyright Elsevier 
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is forward peaked (maximum intensity at 0 = 6, gd = dmin © kOg and corresponds 
to the dipole region of scattering. On a particle model, this low-angle scattering 
represents “soft” collisions with relatively large impact parameter. 

At large energy loss, the scattering becomes concentrated into a Bethe ridge 
(Fig. 3.36) centered around a value of q that satisfies 


(qao)” = E/R + E?/(2moc?R) © E/R (3.131) 
for which the equivalent scattering angle 6, is given (Williams et al., 1984) by 
sin’6, = (E/Eo)[1 + (Eo — E)/(2moc’)|! (3.132) 


or 0; © (E /Eo)\/ 2 = (2 6¢)!'/* for small @ and nonrelativistic incident elec- 
trons. This high-angle scattering corresponds to “hard” collisions with small impact 
parameter, where the interaction involves mainly the electrostatic field of a single 
inner-shell electron and is largely independent of the nucleus. In fact, the E—g rela- 
tion represented by Eq. (3.131) is simply that for Rutherford scattering by a free, 
stationary electron; the nonzero width of the Bethe ridge reflects the effect of nuclear 
binding or (equivalently) the nonzero kinetic energy of the inner-shell electron. 

The energy dependence of the GOS is obtained by taking cross sections through 
the Bethe surface at constant g. In particular, planes corresponding to very small 
values of q (left-hand boundary of Fig. 3.36) give the inner-shell contribution 
df,(0O, E)/dE to the optical oscillator strength per unit energy df(O, E)/dE, which 
is proportional to the photoabsorption cross section 00: 


df(O, E)/dE = df,(O, E)/dE + (df /dEY’ = o0/C (3.133) 


where (df/dE) represents a background contribution from outer shells of lower 
binding energy and C = 1.097 x 107?9m* eV (Fano and Cooper, 1968). 
Experimental values of photoabsorption cross section have been tabulated (Hubbell, 
1971; Veigele, 1973) and can be used to test the results of single-atom calculations 
of the GOS. 

Such a comparison is shown in Figs. 3.37 and 3.38. For K-shell ionization, a 
hydrogenic calculation predicts quite well the overall shape of the absorption edge 
and the absolute value of the photoabsorption cross section. In the case of L-shells, 
the hydrogenic model gives too large an intensity just above the absorption thresh- 
old (particularly for the lighter elements) and too low a value at high energies. This 
discrepancy arises from the oversimplified treatment of screening in the hydro- 
genic model, where the effective nuclear charge Z; is taken to be independent of 
the atomic coordinate r. In reality, energy losses just above the threshold involve 
interaction further from the nucleus, where the effective charge is smaller (because 
of outer-shell screening), giving an oscillator strength lower than the hydrogenic 
value. Conversely, energy losses much larger than E, correspond to close collisions 
for which Z, approaches the full nuclear charge, resulting in an oscillator strength 
slightly higher than the hydrogenic prediction. Also, for low-Z elements, the L3 
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Fig. 3.37 K-shell x-ray absorption edges of several elements. Dotted lines represent the extrap- 
olated background and solid lines denote the photoabsorption cross section calculated using a 
hydrogenic model. The experimental data points are taken from Hubbell (1971) and the dashed 
lines represent Hartree—Slater calculations of McGuire (1971). From Egerton (1979), copyright 
Elsevier 


absorption edge has a rounded shape (Fig. 3.38) due to the influence of the “cen- 
trifugal” term hI (1 + 1)2mor in the effective potential; see Eq. (3.128). To be 
useful for L-shells, the hydrogenic model requires an energy-dependent correction 
chosen to match the observed edge shape of each element (Egerton, 1981a). The 
correction is even larger in the case of M-shells (Luo and Zeitler, 1991). 

The Hartree—Slater model takes proper account of screening and gives a good 
prediction of the edge shape in many elements (Fig. 3.38). In other cases (e.g., 
[p3-edges of transition metals) the agreement is worse, owing to the fact that 
the calculations usually deal only with ionizing transitions to the continuum and 
neglect excitation to discrete (bound) states just above the absorption threshold 
(Section 3.7.1). Moreover, a free-atom model cannot predict the solid-state fine 
structure which becomes prominent close to the threshold (Section 3.8). 

At large qg, a constant-g section of the Bethe surface intersects the Bethe ridge 
(Fig. 3.36). As a result, the energy-loss spectrum of large-angle scattering contains a 
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broad peak that is analogous to the Compton profile for photon scattering and whose 
shape reflects the momentum distribution of the atomic electrons; see Section 5.6.7. 

If df/dE is known as a function of g and E, the angular and energy dependence 
of scattering can be calculated from Eq. (3.26), provided the relationship between q 
and the scattering angle 0 is also known. 


3.6.2 Relativistic Kinematics of Scattering 


In the case of elastic scattering, conservation of momentum leads to a simple rela- 
tion between the magnitude q of the scattering vector and the scattering angle 0 
(see Fig. 3.2), a given value of g corresponding to a single value of 6. In the case 
of inelastic scattering, the value of g depends on both the scattering angle and the 
energy loss.* The relationship between q and 6 is derived by applying the conser- 
vation of both momentum and energy to the collision. Since a 100-keV electron 
has a velocity more than half the speed of light and a relativistic mass 20% higher 
than its rest mass, it is necessary to use relativistic kinematics to derive the required 


3This indicates an additional degree of internal freedom, which on a classical (particle) model of 
scattering corresponds to the interaction between the incident and atomic electrons taking place at 
different points within the electron orbit. 
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relationship. At incident energies above 300 keV, further relativistic effects become 
important, as discussed in Appendix A. 


3.6.2.1 Conservation of Energy 


The total energy W of an incident electron (= kinetic energy Eo + rest energy moc”) 
is given by the Einstein equation: 


W = ymoc? (3.134) 
where y = (1 — v?/c*)~!/*. The incident momentum is 
p= ymov = hko (3.135) 
Combining Eqs. (3.134) and (3.135) gives 
W = [(mge?)? + pe]? = [(nge?)” + Rekge7 1" (3.136) 
Conservation of energy dictates that 
W-E=W' = [(moc?) + Wee?! (3.137) 


where W and ky are the total energy and wave number of the scattered electron, E 
being the energy loss. Using Eq. (3.136) in Eq. (3.137) leads to an equation relating 
the change in magnitude of the fast-electron wavevector to the energy loss*: 


ki = ke — 2E[md [ht + ke /(he)?|!/? + BE? /(he)? 


(3.138) 
= ko — 2ymoE/h? + E?/(icy 


Note that this relationship is independent of the scattering angle. 

For numerical calculations, it is convenient to convert each wave number to a 
dimensionless quantity by multiplying by the Bohr radius ag. Making use of the 
equality Raj = h? /2mo, where R is the Rydberg energy, Eq. (3.138) becomes 


(kao) = (koao)” — (E/R)Ly — E/(2moc’)] (3.139) 


The E* term in Eq. (3.139) is insignificant for most inelastic collisions. The value 
of (kyag)? is obtained from the kinetic energy Eo of the incident electron: 


(koao)” = (Eo/R)( + Eo/2moc?) = (T/R)/( — 27 /moc*) (3.140) 


*For Rutherford scattering from a free electron (where E = h?q*/2mo) and low incident energies 
(such that Eg = hike /2mo), Eq. (3.138) becomes kt -_ ki —@, indicating that the angle between 
q and kj is 90°. In this case, g goes to zero for 6 — 0 (see Fig. 3.39), as implied by Eqs. (3.131) 
and (3.132). 
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where T = mov~/2 is an “effective” incident energy, useful in the equations 
that follow. 


3.6.2.2 Conservation of Momentum 


Momentum conservation is incorporated by applying the cosine rule to the vector 
triangle (Fig. 3.39), giving 


g =k + Ke — 2kok, cos 0 (3.141) 
Taking a derivative of this equation gives (for constant F and Eo) 
d(q)? = 2kok, sin 6 dO = (kok, /1) dQ (3.142) 


Substituting Eq. (3.138) into Eq. (3.141) then gives 


po 2” (: E FE? i” o|_ 2 RP 
uy R yT * 2y2Tmoc? iis R = 2Rmoc2 

(3.143) 
Equation (3.143) can in principle be used to compute gao for any value of 6, but 
for small 6 this procedure requires high-precision arithmetic, since evaluation of the 
brackets in Eq. (3.143) involves subtracting almost identical numbers. For 6 = 0, 
corresponding to g = gmin = ko — ki, binomial expansion of the square root in Eq. 
(3.143) shows that terms up to second order in E cancel, giving 


(qao)o, © E?/4RT + E3/(8y?RT) (3.144) 


For y~3E/T < 1 (which applies to practically all collisions), only the E term is of 
importance and Eq. (3.144) can be written in the form 


Fig. 3.39 Vector triangle for 
inelastic scattering. The 
dashed circle represents the 
locus of point Q that defines 
the different values of g and 0 
possible for a given value of 
kj, equivalent to a given 
energy loss; see Eq. (3.138). 
For E << Eo and small 0, 
RP © SP © ko@g and 

RQ © ki@ © ko; applying 
the Pythagoras rule to the 
triangle PQR then leads to 
Eq. (3.147) 
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min © kOe (3.145) 


where 0¢ = E/(2yT) = E/ (ymov*) is the characteristic inelastic scattering angle. 
For a nonzero scattering angle, it is convenient to evaluate the corresponding 
value of g from 


(qao)* = (koao — kiao)* + 2(koao)(kiao)(1 — cos 6) 
= (qao)>,,,, + 4(koao)(k1ao)sin?( /2) (3.146) 
~ (qao)-,, + 4y7(T/R)sin* (6/2) 


For 6 < 1 rad, Eq. (3.146) is equivalent to 
© min + 48/2)” © kg(6? + 68) (3.147) 


Equation (3.147) is valid outside the dipole region, provided sin 6 ~ @ and is 
relativistically correct provided 6 is defined as E/pv = E/(2yT) rather than as 
E/2Eo. 


3.6.3 Ionization Cross Sections 


For 6 < | rad, the energy-differential cross section can be obtained by integrating 
Eq. (3.29) up to an appropriate collection angle f: 


B 
do 4Rh? Ese 
dE > Emv? dE 

0 


2710(07 + 67) 'd0 (3.148) 


Within the dipole region of scattering, where (qay)* < 1 (equivalent to 8 < 10 mrad 
at the carbon K-edge, for 100-keV incident electrons), the GOS is approximately 
constant and equal to the optical value df(0, E)/dE, so Eq. (3.148) becomes 


do _ AnajR df (0, E), 
dE ET dE 


n [! + (6/6) (3.149) 


To evaluate Eq. (3.148) outside the dipole region, df /dE is computed for each angle 
0, related to q by Eq. (3.141) or (3.147). 

Alternatively, g or gag can be used as the variable of integration. From Eq. (3.26) 
and (3.142), we have 


do _ 4ny2R | df(q.E) dq’) (3.150) 


dE Eke dE 


-1 -1 
= 4nad( 5) (=) [2 an aaoy (3.151) 
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where T = mov2 / 2,R=h / (2moap) = 13.6 eV, and the limits of integration are, 
from Eqs. (3.144) and (3.146), 


(4a0)-min © E’ /(4RT) (3.152) 


(G40) max = (G40)enin + 4Y7(T/R)sin*(B/2) (3.153) 


Integration over a logarithmic grid, as implied by Eq. (3.151), is convenient for 
numerical evaluation, since in the dipole region do /dE peaks sharply at small angles 
but varies much more slowly at larger 6. Equation (3.151) reveals that the energy- 
differential cross section is proportional to the area under a constant-E section 
through the Bethe surface between (gao)min and (qao)max- 

A computation of do/dE is shown in Fig. 3.40. Logarithmic axes are used in 
order to illustrate the approximate behavior: 


do/dE x ES (3.154) 


where s is the downward slope in Fig. 3.40 and is constant over a limited range of 
energy loss. The value of s is seen to depend on the size of the collection aperture, 
an effect that has been confirmed experimentally (Maher et al., 1979). 

For large 8, such that most of the inner-shell scattering contributes to the loss 
spectrum, s is typically about 3 at the ionization edge (E = E;), decreasing toward 
2 with increasing energy loss. This asymptotic E~? behavior reflects the fact that for 


T T. T T 
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Fig. 3.40 Energy-differential 
cross section for K-shell 
ionization of carbon 1074- 
(Ex = 284 eV), calculated 
for different collection 
semi-angles f using 10254 
hydrogenic wavefunctions; 
do x/dE represents the K-loss 
intensity, after subtracting the 


background to the (cm? eV") 


6 
K-ionization edge. From es 

Egerton (1979), copyright ve Fee E(eV) | _ 10 
Elsevier 300 500 700 1000 


5The volume under the Bethe surface is a measure of the electron stopping power. 
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E > E, practically all the scattering lies within the Bethe ridge and approximates 
to Rutherford scattering from a free electron, for which do/dE « q~* « E~?. 

For small 6, s increases with increasing energy loss, the largest value (just 
over 6) corresponding to large E and very small 6. Equation (3.148) gives da/dE « 
E~'0-*df(0, E)/dE o« E~3df(0,E)/dE for very small 8, while df(0, E)/dE « 
E~> for K-shell excitation and E —> oo (Rau and Fano, 1967), so an asymptotic 
E~®- behavior would be expected. 

For thin specimens in which plural scattering is negligible, the inner-shell contri- 
bution to the energy loss spectrum (recorded with a collection semi-angle £) is the 
single-scattering intensity J, i (6, E) given by 


Ji (B, E) = NIgdo/dE (3.155) 


where N is the number of atoms per unit specimen area contributing to the ionization 
edge and Jp is the integrated zero-loss intensity. 


3.6.3.1 Partial Cross Section 


For quantitative elemental analysis, the inner-shell intensity can be integrated over 
an energy range of width A beyond an ionization edge. For a very thin specimen 
(negligible plural scattering) the integrated intensity is 


1,(B, A) = NIoox(B, A) (3.156) 


where the “partial” cross section ox(6, A) is defined by 


do 
ox, A) = ‘ «dE (3.157) 
Ex 


For numerical integration of do/dE, use can be made of the power-law behavior, 
Eq. (3.154), to reduce the required number of energy increments. 

Figure 3.41 shows the calculated angular dependence of K-shell partial cross 
sections for first-row (second-period) elements. The cross sections saturate at large 
values of B (i.e., above the Bethe ridge angle 6,) due to the fall in df/dE outside the 
dipole region. The median scattering angle (for energy losses in the range Ex to Ex+ 
A), corresponding to a partial cross section equal to one half the saturation value, 
is typically 56¢, where 9g = (Ex + A/2)/(2yT). Figure 3.41 shows that whereas 
the saturation values decrease with increasing incident energy, the low-angle cross 
sections increase. This behavior results from the fact that the angular distribution 
becomes more sharply peaked about 6 = 0 as the incident energy increases, so a 
small collection aperture accepts a greater fraction of the scattering. 
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3.6.3.2 Integral and Total Cross Sections 


For a very large integration range A, the partial cross section becomes equivalent to 
the “integral” cross section o;(f) for inner-shell scattering into angles up to 6 and 
all possible values of energy loss. This cross section can be evaluated by choosing 
the upper limit of integration in Eq. (3.157) so that contributions from higher energy 
losses are negligible. For small 6, taking an upper limit equal to 3E; gives less than 
1% error due to higher losses, but for large 6 the limit must be set higher because 
of contributions from the Bethe ridge. 

For B less than the Bethe ridge angle 6, © (20¢)!/, the integral cross section can 
be predicted with moderate accuracy (Egerton, 1979) by using a formula analogous 
to Eq. (3.149): 


ox(B) = Aa9(R/TYR/Eyicln| 1 + (6/8) | (3.158) 


where the mean energy loss E is defined by 


Eo Eo 
k= fe G2) ag [Ge (3.159) 
~ dE dE : 
0 


0 
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and 0g = E/2yT. Typically E ~ 1.5E, and the quantity f; in Eq. (3.158) is the 
dipole oscillator strength, which for K-shell ionization is approximately 2.1 — Z/27. 

By setting B = 7, the integral cross section becomes equal to the total cross 
section o, for inelastic scattering from shell k. An approximate expression for o, is 
the “Bethe asymptotic cross section” (Bethe, 1930): 


ox © AcagNube(R/T\(R/Ex) In(ckT/Ex) (3.160) 


where N; is the number of electrons in shell k (2, 8, and 18 for K-, L-, and M- 
shells), while by(* fc /Nx) and cx(* 4E,;/E) are factors that can be obtained by 
calculation or from measurements of cross section (Inokuti, 1971; Powell, 1976). 
If experimental values are available for different incident energies, a plot of To, 
against In T (known as a Fano plot) should yield a straight line, according to Eq. 
(3.160). Linearity of the Fano plot is sometimes used as a test of the reliability of 
the measured cross sections or of the applicability of Bethe theory (for example, 
at low incident energies). The slope and intercept of the plot give the values of bx 
and cx. At incident electron energies above about 200 keV, Bethe theory must be 
modified to take account of retardation effects and a modified form of the Fano plot 
is required (Appendix A). 


3.7 The Form of Inner-Shell Edges 


In this section, we consider first the overall shape of ionization edges, as deduced 
from atomic calculations (Leapman et al., 1980; Rez, 1982), photoabsorption data 
(Hubbell, 1971; Veigele, 1973), and libraries of EELS data (Zaluzec, 1981; Ahn and 
Krivanek, 1983; Ahn, 2004). We concentrate on edges within the energy range 50- 
2000 eV, which are more easily observable by EELS. A table of edge shapes and 
edge energies is given in Appendix D, together with a table showing the relation- 
ship between the quantum-mechanical and spectroscopy notations for inner-shell 
excitation. 


3.7.1 Basic Edge Shapes 


Because the wavefunctions of core electrons change relatively little when atoms 
aggregate to form a solid, an atomic model provides a useful indication of the gen- 
eral shape of inner-shell edges. Following Manson (1972), Leapman et al. (1980) 
calculated differential cross sections for K-, L-, and M-shell ionization on the basis 
of the Hartree—Slater central-field model (Section 3.6.1). Their results for K-shell 
edges are shown in Fig. 3.42a, where the vertical axis represents the core-loss 
intensity after background subtraction and in the absence of plural scattering and 
instrumental broadening. Although the vertical scale in Fig. 3.42 refers to an inci- 
dent energy of 80 keV and a collection semi-angle 6 of 3 mrad, the K-edges retain 
their characteristic “sawtooth” shape for different values of Ep and ; see Fig. 3.42b. 
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Fig. 3.42 (a) Energy-differential cross section for K-shell ionization in boron, carbon, nitrogen, 
and oxygen, calculated for 80-keV incident electrons and 3-mrad collection semi-angle using the 
Hartree—Slater method. (b) Comparison of Hartree—Slater and hydrogenic calculations for the car- 
bon K-edge, taking Ey = 80 keV and collection semi-angles of 10 and 100 mrad (Leapman et al., 
1980) 


As seen in Fig. 3.43, measured K-shell edges conform to this same overall shape 
but with the addition of some pronounced fine structure. The K-ionization edges 
remain basically sawtooth shaped for third-period elements (Na to Cl). Their inten- 
sities are lower, resulting in a relatively high noise content in the experimental 
data. 

Calculations of L273 edges (excitation of 2p electrons) in third-period elements 
(Na to Cl) show that they have a more rounded profile, as in Fig. 3.44. The intensity 
exhibits a delayed maximum of 10-20 eV above the ionization threshold, resulting 
from the / (/ + 1) term in the radial Schrédinger equation, Eq. (3.128), which causes 
a maximum to appear in the effective atomic potential. At energies just above the 
ionization threshold, this “centrifugal barrier” prevents overlap between the initial 
(2p) and final-state wavefunctions, particularly for final states with a large angular 
momentum quantum number /’. Measured L23 edges display this delayed maximum 
(Ahn, 2004), although excitonic effects can sharpen the edge, particularly in the case 
of insulating materials (see page 212). 

Fourth-period elements give rise to quite distinctive L-edges. Atomic calcula- 
tions predict that the L23 edges of K, Ca, and Sc will be sharply peaked at the 
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Fig. 3.43 K-ionization edges (after background subtraction) measured by EELS with 120-keV 
incident electrons and collection semi-angles in the range of 3-15 mrad. The background intensity 
before each edge has been extrapolated and subtracted, as described in Chapter 4. From Zaluzec 
(1982), copyright Elsevier 
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Fig. 3.44 Hartree—Slater calculations of L23 edges, for 80-keV incident electrons and a collection 
semi-angle of 10 mrad (Leapman et al., 1980) 
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ionization threshold because of a “resonance” effect: the dipole selection rule favors 
transitions to final states with d-character (// = 2) and the continuum 3d wavefunc- 
tion is sufficiently compact to fit mostly within the centrifugal barrier, resulting in 
strong overlap with the core-level (2p) wavefunction and a large oscillator strength 
at threshold (Leapman et al., 1980). These sharp threshold peaks are known as 
white lines since they were first observed in x-ray absorption spectra where the 
high absorption peak at the ionization threshold resulted in almost no blackening 
on a photographic plate. The white lines are mainly absent in the calculated L3- 
edge profiles of transition metal atoms (Fig. 3.44) because the calculations neglect 
excitation to bound states (discrete unoccupied 3d levels). In a solid, however, these 
atomic levels form a narrow energy band with a high density of vacant d-states, 
leading to the strong threshold peaks observed experimentally; see Fig. 3.45. 

Spin-orbit splitting causes the magnitude of the Ly binding energy to be slightly 
higher than that of the L3 level. Consequently, two threshold peaks are observed, 
whose separation increases with increasing atomic number (Fig. 3.45). The ratio 
of intensities of the L3 and Lz white lines is found to deviate from the “statisti- 
cal” value (2.0) based on the relative occupancy of the initial-state levels (Leapman 
et al., 1982). This deviation is caused by spin coupling between the core hole and 
the final state (Barth et al., 1983) and is useful for determining valence state; see 
Section 5.6.4. 


energy loss (1 eV/division) energy loss (2 eV/division) 


Fig. 3.45 L-edges of fourth-period elements measured using 120-keV electrons and a collection 
semi-angle of 5.7 mrad. From Zaluzec (1982), copyright Elsevier 
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Fig. 3.46 Cu L3 edges in 
metallic copper and in cupric 
oxide, measured using 
75-keV electrons scattered up 
to 6 = 2 mrad. From 
Leapman et al. (1982), 
copyright American Physical 
Society. Available at http:// 
link.aps.org/abstract/PRB/ 
v26/p614 
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In the case of metallic copper, the d-band is full and threshold peaks are absent, 
but in compounds such as CuO electrons are drawn away from the copper atom, 
leading to empty d-levels and sharp Lz and L3 threshold peaks; see Fig. 3.46. Fourth- 
period elements of higher atomic number (Zn to Br) have full d-shells and the L23 
edges display delayed maxima, as in the case of Ge (Fig. 3.45). 

Hartree—Slater calculations of L; edges indicate that they have a sawtooth shape 
(like K-edges) but relatively low intensity. They are usually observable as a small 
step on the falling background of the preceding L293 edge; see Fig. 3.45. 

Msgs edges are prominent for fifth-period elements and appear with the intensity 
maximum delayed by 50-100 eV beyond the threshold (Fig. 3.47) because the cen- 
trifugal potential suppresses the optically preferred 3d — 4f transitions just above 
the threshold (Manson and Cooper, 1968). Within the sixth period, between Cs 
(Z = 55) and Yb (Z = 70), white-line peaks occur at the threshold due to a high 
density of unfilled f-states (Fig. 3.48a). The M4—Ms splitting and the Ms/M, inten- 
sity ratio increase with the atomic number (Brown et al., 1984; Colliex et al., 1985). 
Above Z = 71, the M4 and Ms edges occur as rounded steps (Ahn, 2004), making 
them harder to recognize. 

M3 edges of elements near the beginning of the fourth period (K to Ti) occur 
below 40 eV, superimposed on a rapidly falling valence-electron background that 
makes them appear more like plasmon peaks than typical edges. M23 edges of the 
elements V to Zn are fairly sharp and resemble K-edges (Hofer and Wilhelm, 1993); 
My, edges are weak and are rarely observed in energy-loss spectra. 
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Fig. 3.47 (a) M45 edges for Ep = 80 keV and 6 = 10 mrad, according to Hartree—Slater calcula- 
tions (Leapman et al., 1980). (b) M4s edges of fifth-period elements measured with Ey = 120 keV 
and 6 = 5.7 mrad (Zaluzec, 1982) 
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Fig. 3.48 (a) Energy-loss spectrum of a lanthanum oxide thin film, recorded with 200-keV elec- 
trons and collection semi-angle of 100 mrad, showing Ms and M4 white lines followed by M3 and 
Mp edges. (b) O4s edge of thorium recorded using 120-keV electrons and 100-mrad acceptance 
angle. Reproduced from Ahn and Krivanek (1983) 


N67, 023, and O4s edges have been recorded for some of the heavier elements 
(Ahn and Krivanek, 1983). In thorium and uranium, the O4s edges are prominent 
as a double-peak structure (spin-orbit splitting ~ 10 eV) between 80- and 120-eV 
energy loss; see Fig. 3.48b. 
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3.7.2 Dipole Selection Rule 


Particularly when a small collection aperture is employed, the transitions that 
appear prominently in energy-loss spectra are mainly those for which the dipole 
selection rule applies, as in the case of x-ray absorption spectra. From Eq. (3.145), 
the momentum exchange is approximately igmin ~ hkoPg = E/v for 0 < 
0, whereas the momentum exchange upon absorption of a photon of energy E 
is hgq(photon) = E/c. The ratio of momentum exchange in the two cases is 
therefore 


Gmin/q(photon) © c/v (3.161) 


and is less than 2 for incident energies above 80 keV. Therefore the optical selection 
rule AJ = +1 applies approximately to the energy-loss spectrum. This dipole rule 
accounts for the prominence of L23 edges (2p — 3d transitions) in transition metals 
and their compounds (high density of d-states just above the Fermi level) and of Mas 
edges in the lanthanides (high density of unfilled 4f states). 

In the case of a large collection aperture, the momentum transfer can be several 
times /i dmin (the median scattering angle for inner-shell excitation is typically 50g; 
see Fig. 3.41) and dipole-forbidden transitions are sometimes observed (Section 
3.8.2). For example, sharp M2 and M3 peaks are seen in the spectrum of lanthanum 
oxide (Fig. 3.48a), representing A/ = 2 transitions from the 3p core level to a high 
density of unfilled 4f states. These peaks almost disappear when a small (1.6 mrad) 
collection angle is used (Ahn and Krivanek, 1983). 


3.7.3 Effect of Plural Scattering 


In discussing edge shapes, we have so far assumed that the specimen is very thin 
(t/A < 0.3, where A is the mean free path for all inelastic scattering) so that we 
can ignore the possibility of a transmitted electron being inelastically scattered by 
valence electrons, in addition to exciting an inner shell. In thicker samples, this sit- 
uation no longer applies and a broad double-scattering peak appears at an energy 
loss of approximately Ex + Ep, where Ep is the energy of the main “plasmon” 
peak observed in the low-loss region. In even thicker specimens, higher order satel- 
lite peaks merge with the double-scattering peak to produce a broad hump beyond 
the edge, completely transforming its shape and obliterating any fine structure; see 
Figs. 3.33 and 3.49. This behavior points to the advantage of using very thin spec- 
imens for the identification of ionization edges and the analysis of fine structure. 
Within limits, however, such plural or “mixed” scattering can be removed from the 
spectrum by deconvolution (Section 4.3). 


204 3 Physics of Electron Scattering 


a) b 


1 


500 855 —-» E(eV) 


Fig. 3.49 (a) Energy-loss spectrum recorded from a thick (t/A * 1.5) specimen of nickel 
oxide. (b) Oxygen-K and nickel-L edges, after background removal. (c) Edge profiles after plural 
scattering was removed by Fourier ratio deconvolution (Zaluzec, 1983) 


3.7.4 Chemical Shifts in Threshold Energy 


When atoms combine to form a molecule or a solid, the outer-shell wavefunctions 
change to become molecular orbitals or Bloch functions, their energy levels then 
reflecting the overall chemical or crystallographic structure. Although core-level 
wavefunctions are altered to a much smaller extent, the energy of the core level may 
change by several electron volts, depending on the chemical and crystallographic 
environment of the atom involved. 

Core-level binding energies can be measured directly by x-ray photoelectron 
spectroscopy (XPS). In this technique, a bulk specimen is illuminated with 
monochromatic x-rays and an electron spectrometer is used to measure the kinetic 
energies of photoelectrons that escape into the surrounding vacuum. The final state 
of the electron transition therefore lies in a continuum far above the vacuum level 
and is practically independent of the specimen. In the case of a compound, any 
increase in binding energy of a core level, relative to its value in the pure (solid) 
element, is called a chemical shift. For metallic core levels in oxides and most other 
compounds, the XPS chemical shift is positive because oxidation removes valence- 
electron charge from the metal atom, reducing the screening of its nuclear field and 
deepening the potential well around the nucleus. 

The ionization-edge threshold energies observed in EELS or in x-ray absorption 
spectroscopy (XAS) represent a difference in energy between the core-level initial 
state and the lowest energy final state of the excited electron. The corresponding 
chemical shifts in threshold energy are more complicated than in XPS because the 
lowest energy final state lies below the vacuum level and its energy depends on the 
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valence-electron configuration. For example, going from a conducting phase (such 
as graphite) to an insulator (diamond) introduces an energy bandgap, raising the 
first-available empty state by several electron volts and increasing the ionization 
threshold energy (Fig. 1.4). On the other hand, the edge threshold in many ionic 
insulators corresponds to excitation to bound exciton states within the energy gap, 
reducing the chemical shift by an amount equal to the exciton binding energy. 

The situation is further complicated by a many-body effect known as relaxation. 
When a positively charged core hole is created by inner-shell excitation, nearby 
electron orbitals are pulled inward, reducing the magnitude of the measured binding 
energy by an amount equal to the relaxation energy. In XPS, where the excited 
electron leaves the solid, measured relaxation energies are some tens of electron 
volts. In EELS or XAS, however, a core electron that receives energy just slightly 
in excess of the threshold value remains in the vicinity of the core hole and the 
screening effect of its negative charge reduces the relaxation energy. In a metal, 
conduction electrons provide additional screening that is absent in an insulating 
compound, so while relaxation effects may be less in EELS than in XPS, differences 
in relaxation energy between a metal and its compounds can have an appreciable 
influence on the chemical shift (Leapman et al., 1982). 

Muller (1999) has pointed out that core-level shifts can be opposite in sign to 
those expected from electronegativity arguments, and in binary alloys the shift can 
be of the same sign for both elements. Measured core-level shifts in Ni—-Al and 
Ni-Si alloys were found to be proportional to valence band shifts deduced from 
linear muffin-tin orbital calculations. In metals, therefore, the core-loss shift appears 
to be largely determined by changes in the valence band, rather than by charge 
transfer. The width of the valence band varies with changes in the type, number, 
and separation of neighboring atoms, the core level tracking these valence band 
shifts to within 0.1 eV. As a result, the EELS chemical shift is capable of providing 
information about the occupied electronic states of a metal. 

Measured EELS chemical shifts of metal-atom L3 edges in transition metal 
oxides are typically | or 2 eV and either positive or negative (Leapman et al., 1982). 
The shifts of K-absorption edges in the same compounds are all positive and in the 
range of 0.7—10.8 eV (Grunes, 1983). XAS chemical shifts, largely equivalent to 
those registered by EELS, have been studied extensively. The absorption edge of 
the metal atom in a compound is usually shifted to higher photon energy (compared 
to the metallic element) and these positive chemical shifts range up to 20 eV (for 
KMnOz,) in the case of K-edges of transition metals. 

Because transition series elements can take more than one valency, there exist 
mixed-valency compounds (e.g., Fe304 and Mn3O,z) containing differently charged 
ions of the same species. Since the chemical shift increases with increasing oxida- 
tion state, a double edge or multiple edges may be observed. In chromite spinel, for 
example, the L3 and the Lz white lines are each split by about 2 eV because of the 
presence of both Cr?* and Cr** ions. Since these two ions occupy different sites 
(tetrahedral and octahedral) within the unit cell, the observed splitting will include 
a contribution (estimated as 0.7 eV) arising from the different site symmetry (Taft 
and Krivanek, 1982b). 
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Site-dependent chemical shifts can also occur in organic compounds in which 
carbon atoms are present at chemically dissimilar sites within a molecule. For exam- 
ple, the carbon K-edge recorded from a nucleic acid base shows a series of peaks, 
interpreted as being the result of several edges chemically shifted relative to one 
another due to the different effective charges on the carbon atoms (Isaacson, 1972b). 


3.8 Near-Edge Fine Structure (ELNES) 


Core-loss spectra recorded from solid specimens often show a pronounced fine 
structure, taking the form of peaks or oscillations in intensity within 50 eV of the 
ionization threshold. Most of this structure reflects the influence of atoms surround- 
ing the excited atom and requires a solid-state explanation. The basic principles are 
described in several review articles (Brydson, 1991; Sawatzky, 1991; Rez, 1992; Rez 
et al., 1995; Mizuguchi, 2010). In the sections below, we outline several approaches 
that have been used to interpret the fine structure in different types of material, 
namely 


1. the band structure approach, in which the final states of the excited core electron 
are Bloch states in an infinite solid; 

2. the multiple scattering concept, which considers backscattering of the excited 
electron within a cluster of typically a few hundred atoms; 

3. the molecular orbital picture, in which the final state involves just a few atoms; 

4. multiplet processes, in which several electrons are involved, but all within the 
same atom. 


3.8.1 Densities-of-States Interpretation 


Modulations of the single-scattering intensity J,'(E) can be related to the band 
structure of the solid in which scattering occurs. The theory is greatly simplified 
by making a one-electron approximation: excitation of an inner-shell electron is 
assumed to have no effect on the other atomic electrons. According to the Fermi 
golden rule of quantum mechanics (Manson, 1978), the transition rate is then pro- 
portional to a product of the density of final states N(EZ) and an atomic transition 
matrix M(E): 


Ju(E) x do/dE « |M(E)/?N(E) (3.162) 


M(E) represents the overall shape of the edge, discussed in Section 3.7.1, and is 
determined by atomic physics, whereas N(E) depends on the chemical and crystal- 
lographic environment of the excited atom. To a first approximation, M(E) can be 
assumed to be a slowly varying function of energy loss E, so that variations in J i (E) 
represent the energy dependence of the densities of states (DOS) above the Fermi 
level. However, the following qualifications apply. 
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(1) Transitions occur only to a final state that is empty; like x-ray absorption spec- 
troscopy, EELS yields information on the density of unoccupied states above 
the Fermi level. 

(2) Because the core-level states are highly localized, N(E) is a local density of 
states (LDOS) at the site of the excited atom (Heine, 1980). As a result, there 
can be appreciable differences in fine structure between edges representing dif- 
ferent elements in the same compound; see Fig. 3.50. Even in the case of a 
single element, the fine structure may be different at sites of different symmetry, 
as demonstrated by Tafto (1984) for Al in sillimanite. 

(3) The strength of the matrix element term is governed by the dipole selection rule: 
Al = +1, with A/ = | transitions predominating (see Sections 3.7.2 and 3.8.2). 
As a result, the observed DOS is a symmetry-projected density of states. Thus, 
modulations in K-edge intensity (1s initial state) reflect mainly the density of 2p 
final states. Similarly, modulations in the Lz and Lz intensities (2p initial states) 
are dominated by 3d final states, except where p — d transitions are hindered 
by a centrifugal barrier (for example, p — s transitions are observed close to 
silicon L273 threshold). As a result of this selection rule, a dissimilar structure can 
be expected in the K- and L-edges of the same element in the same specimen. 

(4) M(E) is in principle a joint density of states, the energy dependence of the final- 
state density being convolved with that of the core level. The core-level width 1’; 
is given approximately by the uncertainty relation jt ~ fi, where the lifetime 
Th Of the core hole is determined by the speed of the de-excitation mechanism 
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Fig. 3.50 Dashed curves: near-edge fine structure calculated for cubic boron nitride using pseudo- 
atomic orbital band theory. Solid curves: core-loss intensity measured by P. Fallon. From Weng 
et al. (1989), copyright American Physical Society. Available at http://link.aps.org/abstract/PRB/ 
v40/p5694 
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Fig. 3.51 (a) Energy z I I I 
broadening Tj of core levels, 8 }-| Circles: Krause & Oliver|_! 
according to Krause and 
Oliver (1979) and Brown 
(1974); the solid lines 
represent parameterized 
formulas (Egerton, 2007). 

(b) Final-state energy 
broadening as a function of 
energy above threshold, 
calculated from Eq. (3.163) 
with m = mo. The data points 
represent measurements on 
aluminum by Hébert (2007) 


“| triangles: Brown (1974) 
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(mainly Auger emission in light elements) and the value of Ij depends mainly 
on the threshold energy of the edge; see Fig. 3.51a. 


Further spectral broadening occurs because of the limited lifetime t+ of the final 
state. Transition to a final-state energy ¢ above the ionization threshold results in an 
electron (effective mass m) moving away from the atom with speed v and kinetic 
energy ¢ = mv*/2. This energy is lost within a lifetime of approximately Aj;/v, the 
inelastic mean free path A; being limited to a few nanometers for e < 50 eV (see 
later, Fig. 3.57). The resulting energy broadening is therefore 


Vp © h/t © hv/Ay = (h/Ai)2e/m)'/? (3.163) 


Based on photoelectron spectroscopy, Seah and Dench (1979) parameterized the 
inelastic mean free path (for | eV < € < 10* eV) in the form 


A(nm) = 538ae~* + 0.41a7/7¢!/2 (3.163a) 
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where « is in eV and a is the atomic diameter; a> = A/(602p¢), with A = atomic 
weight and p = specimen density in g/em*. According to Eqs. (3.163) and (3.163a), 
I¢ increases with excitation energy above the threshold and the observed DOS struc- 
ture is progressively damped with increasing energy loss. For ¢ > 60 eV, the ¢!/? 
term in Eq. (3.163a) dominates and I's tends to a limit (0.93 eV)a>*?, between 7 
and 9 eV for simple metals; see Fig. 3.51b. 

The measured ELNES is also broadened by the instrumental energy resolution 
AE. To allow for this broadening and that due to the initial-state width, calculated 
densities of states can be convolved with a Gaussian or Lorentzian function of width 
(;2 + AE)!/?, In the case of L23 or Mas edges, spin-orbit splitting results in two 
initial states with different energy (Fig. 3.45) and the measured ELNES therefore 
consists of two shifted DOS distributions. This effect can be removed by Fourier 
ratio deconvolution, with the low-loss region replaced by two delta functions whose 
strengths are suitably adjusted (Leapman et al., 1982). 

Band structure calculations that predict the electrical properties of a solid give 
the total densities of states and provide only approximate correlation with measured 
ELNES. For a more accurate description, the DOS must be resolved into the cor- 
rect angular momentum component at the appropriate atomic site. Pseudopotential 
methods have been adapted to this requirement and were used to calculate ELNES 
for diamond, SiC, and Be2C (Weng et al., 1989). The augmented plane wave (APW) 
method has provided realistic near-edge structures of transition metal compounds 
(Muller et al., 1982; Blaha and Schwarz, 1983). Some of the options are described 
in a review paper by Mizoguchi et al. (2010). The Wien2k program for perform- 
ing band structure calculations based on density functional theory is described by 
Hébert (2007); see also Section 4.7.2. 


3.8.1.1 Nondipole Effects 


As indicated by Eq. (3.162), energy-loss fine structure represents the density of final 
states modulated by an atomic transition matrix. If the initial state is a closed shell, 
the many-electron matrix element of Eq. (3.23) can be replaced by a single-electron 
matrix element, defined by 


M(q, E) = / We exp(iq - r)yidt (3.164) 


where w; and wr are the initial- and final-state single-electron wavefunctions and 
the integration is over all volume t surrounding the initial state. Expanding the 
operator as 


exp(iq-r) = 1+ i(q-r) + higher order terms (3.164a) 
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enables the integral in Eq. (3.164) to be split into components. The first of these, 
arising from the unity term in Eq. (3.164a), is zero because w; and wr are orthogo- 
nal wavefunctions. The second integral containing (q-r) is zero if w; and ys have the 
same symmetry about the center of the excited atom (7 = 0) such that their product 
is even; q -r itself is an odd function and the two halves of the integral then can- 
cel. But if yj is an s-state (even symmetry) and yf is a p-state (odd symmetry), the 
integral is nonzero and transitions are observed. This is the basis of the dipole selec- 
tion rule, according to which the observed N(E) is a symmetry-projected density of 
states. 

For the dipole rule to hold, the higher order terms in Eq. (3.163) must be 
negligible; if not, a third integral (representing dipole-forbidden transitions) will 
modify the energy dependence of the fine structure. From the above argument, the 
dipole condition is defined by the requirement q- r < 1 for all r, equivalent to 
q < da = 1/rc, where rg is the radius of the core state (defining the spatial region 
in which most of the transitions occur). The hydrogenic model gives rg ~ ao/Z*, 
where Z* is the effective nuclear charge. 

For K-shells, Z* ~ Z — 0.3 (see Section 3.6.1); for carbon K-shell excitation 
by 100-keV electrons, dipole conditions should therefore prevail for6 « 6g = 
Z*agko = 67 mrad, a condition fulfilled for most of the transitions since the median 
angle of scattering is around 10 mrad (Fig. 3.41). In agreement with this esti- 
mate, atomic calculations indicate that nondipole contributions are less than 10% 
of the total for g < 45 nm7!, equivalent to 6 < 23 mrad for 100-keV electrons 
(Fig. 3.52a,b). A small spectrometer collection aperture (centered about the optic 
axis) can therefore ensure that nondipole effects are minimized. Saldin and Yao 
(1990) argue that dipole conditions hold only over an energy range €max above the 
excitation threshold, with €max * 33 eV for Z = 3, increasing to 270 eV for Z = 
8. Dipole conditions should therefore apply to the ELNES of elements heavier than 
Li and to the EXELFS region for oxygen and heavier elements, for the incident 
energies used in transmission spectroscopy. 

For L23 edges, atomic calculations (Saldin and Ueda, 1992) give gaag ~ Z*/9 
with Z* = Z — 4.5, so for silicon and 100-keV incident electrons 6, ~ 11 mrad. 
Solid-state calculations for Si (Ma et al., 1990) have suggested that nondipole effects 
are indeed small (within 5 eV of the threshold) for 12.5-mrad collection semi-angle 
(Fig. 3.52c) but are substantial for a large collection aperture, where monopole 
2p — 3p transitions make a substantial contribution (Fig. 3.53d). Monopole tran- 
sitions have been observed at the Si-L23 edge of certain minerals and have been 
attributed to the low crystal symmetry that induces mixing of p- and d-orbitals 
(Brydson et al., 1992a). 

A high density of dipole-forbidden states just above the Fermi level may lead 
to observable monopole peaks, but mainly in spectra recorded with a displaced 
collection aperture where the momentum transfer is large (Auerhammer and Rez, 
1989). The dipole approximation appears justified for all M-edges, at incident ener- 
gies above 10 keV and with an axial collection aperture (Ueda and Saldin, 1992). 
A further discussion of nondipole effects is given by Hébert (2007) and (for the 
low-loss region) by Gloter et al. (2009). 
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Fig. 3.52 Generalized oscillator strength (on a logarithmic scale) as a function of wave number 
for transitions to s, p, and d final states 5 eV above the edge threshold, calculated for (a) carbon 
K-edge (1s initial state) and (b) titanium L3 edge (2p*/ ? initial state). From Rez (1989), copyright 
Elsevier. (c, d) Silicon L23 differential cross section for 100-keV incident electrons and acceptance 
angles of 12.5 and 100 mrad. Solid lines are results of LCAO calculations, not using the dipole 
approximation; dotted lines represent the contribution from 2p — 3p transitions. Dashed lines 
represent the dipole approximation. Reprinted with permission from Ma et al. (1990), copyright 
1990, American Institute of Physics 


3.8.1.2 Core Hole Effects 


After core electron excitation, an inner-shell vacancy (core hole) is left behind; see 
Fig. 3.53b. Because the core hole perturbs the final state of the transition, one- 
electron band structure theory is not an exact description of ELNES. Core hole 
effects can be included within band structure calculations by using a supercell 
method (Mizoguchi et al., 2010). Band structure calculations assume periodicity, 
whereas the core hole occurs only once, so it is necessary to use a supercell much 
larger than the unit cell of the crystal being simulated. For the Mg K-edge in MgO, 
Mizoguchi et al. (2010) found that a 54-atom supercell was necessary to avoid 
artifacts; see Fig. 3.53f. 

A two-particle approximation is to generalize the concept of density of states 
N(E) to include temporary bound states formed by interaction between the excited 
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Fig. 3.53 (a)—(d) Square of the wavefunction for an electron at the bottom of the MgO conduction 
band, in the ground state, in an excited state with a core hole at the Mg |s site, and in the Z+1 
approximation (Mg replaced by an Al atom). (e)-(g) Mg K-edge ELNES calculated using the 
all-electron OLCAO method, without and with a core hole, and using the Z+1 procedure. From 
Mizoguchi et al. (2010), copyright Elsevier 


core electron and the core hole: core excitons. Since their effective radius may be 
larger or smaller than atomic dimensions, they are analogous to the Wannier or 
Frenkel excitons observed in the low-loss region of the spectrum (Section 3.3.3). 
The exciton energies can be estimated by use of the Z + 1 or optical-alchemy 
approximation (Hjalmarson et al., 1980; Elsasser and Késtlmeier, 2001) in which 
the potential used in DOS or MS calculations is that of the next-highest atom in 
the periodic table. This is equivalent to assuming that the core hole increases (by 
one unit) the effective charge seen by outer electrons. Although not equivalent to a 
multi-particle calculation, the Z+1 approximation often gives an improvement over 
a ground-state calculation; see Fig. 3.52e—g. 

In an insulator, core-exciton levels lie within the energy gap between valence 
and conduction bands and may give rise to one or more peaks below the threshold 
for ionization to extended states (Pantelides, 1975). Such peaks can be identified 
as excitonic if band structure calculations are available on an absolute energy scale 
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and if the energy-loss axis has been accurately calibrated (Grunes et al., 1982). Even 
when peaks are not visible, excitonic effects may sharpen the ionization threshold, 
modify the fine structure, or shift the threshold to lower energy loss, as proposed for 
graphite (Mele and Ritsko, 1979) and boron nitride (Leapman et al., 1983). These 
effects may be somewhat different in EELS, as compared to x-ray absorption spec- 
troscopy, if the excited atom relaxes before the fast electron exits the exciton radius 
(Batson and Bruley, 1991; Batson, 1993b). 

In ionic compounds, the exciton is more strongly bound at the cation than at the 
anion site (Pantelides, 1975; Hjalmarson, 1980), introducing further differences in 
near-edge structure at the respective ionization edges. In the case of a metal, the 
effect of the core hole is screened within a short distance (~0.1 nm), but electron— 
hole interaction may still modify the shape of an ionization edge: if the initial state 
is s-like, the edge may become more rounded; if p-like, it may be sharpened slightly 
(Mahan, 1975). 

To properly describe the white line features present in L23 edges of transition 
metals and M45 edges of the lanthanides, a relativistic many-particle calculation is 
required, including electron—electron and electron-hole interactions (Ikeno et al., 
2006). A more empirical procedure is based on fitting parameters (de Groot and 
Kotani, 2008). 


3.8.2 Multiple-Scattering Interpretation 


An alternative approach to understanding ELNES makes use of concepts first devel- 
oped to explain x-ray absorption near-edge structure (XANES, also referred to as 
NEXAFS). This is an extension of EXAFS theory, taking into account multiple 
(plural) elastic scattering of the ejected core electron. Multiple scattering is impor- 
tant in the near-edge region, where backscattering occurs in a larger volume of the 
specimen as a result of the long inelastic mean free path of the low-energy ejected 
electron (page 219). Even so, the results reflect the Jocal environment of the excited 
atom. This environment is divided into concentric shells surrounding the excited 
atom and backscattering from these shells is calculated sequentially. Documented 
programs for performing such calculations are available (Durham et al., 1982; 
Vvedensky et al., 1986; Ankudinov et al., 1998). Usually a moderate number of 
coordination shells is sufficient to achieve convergence; see Fig. 3.54. The backscat- 
tering itself occurs within a diameter of several nanometers but electron waves 
scattered from more distant shells have almost random phase and their contribu- 
tions tend to cancel, so the ELNES modulations represent information mostly from 
1 nm of the excited atom (Wang et al., 2008a). 

In the case of MgO, the Mg cations scatter weakly and do not contribute appre- 
ciably to the fine structure. The peak labeled C in Fig. 3.54b arises from single 
scattering from oxygen nearest neighbors and therefore appears when only two 
shells are used in the calculations. Peak B represents single scattering from second- 
nearest oxygen atoms and emerges when four shells are included. Peak A is believed 
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Fig. 3.54 Multiple scattering simulation of the fine structure of (a) magnesium K-edge and 
(b) oxygen K-edge of MgO, shown as a function of the number of shells used in the calculation. 
Spectra at the top represent EELS measurements with background subtracted and plural scatter- 
ing removed by deconvolution. From Lindner et al. (1986), copyright American Physical Society. 
Available at http://link.aps.org/abstract/PRB/v33/p22 


to arise from plural scattering among oxygen nearest neighbors (Rez et al., 1995). 
Comparison of the L-edges of MgO and Mg(OH)2 has shown that the differences 
are mainly due to atoms in the second and third shells, showing that medium-range 
structure can be important and amenable to ELNES analysis (Jiang et al., 2008). 
In the transition metal oxides, backscattering from nearest-neighbor oxygen atoms 
gives rise to a prominent peak about 40 eV from the metal-L23 edge, providing a 
convenient test for oxidation (Wang et al., 2006). 

An effect found to be important for higher order shells is focusing of the ejected- 
electron wave by intermediate shells, in situations where atoms are radially aligned 
(Lee and Pendry, 1975). A related effect arises from the centrifugal barrier created 
by first-neighbor atoms, which acts on high angular momentum components of the 
emitted wave, confining it locally for energies just above the edge threshold. This 
shape-resonance effect has been used to interpret absorption spectra of diatomic 
gases (Dehmer and Dill, 1977) and transition metal complexes (Kutzler et al., 1980). 
The resonance value of the ejected-electron wave number k obeys the relationship 
kR = constant, where R is the bond length (Bianconi, 1983; Bianconi et al., 1983a), 
resulting in the resonance energy (above threshold) being proportional to 1/R?. A 
similar behavior is apparent from band structure calculations of transition metal 
elements (Muller et al., 1982): for metals having the same crystal structure, the 
energies of DOS peaks are proportional to 1/a*, where a is the lattice constant. 
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Although formally equivalent to a densities-of-states interpretation of ELNES 
(Colliex et al., 1985), multiple scattering (MS) calculations are performed in real 
space. It is therefore possible to treat disordered systems or complicated molecules 
such as hemoglobin (Durham, 1983) and calcium-containing proteins (Bianconi 
et al., 1983b) for which band structure calculations would not be feasible. 


3.8.3 Molecular-Orbital Theory 


In many covalent materials, a useful explanation of ELNES is in terms of molecular 
orbital (MO) theory (Glen and Dodd, 1968): the local band structure is approxi- 
mated as a linear combination of atomic orbitals (LCAO) of the excited atom and 
its immediate neighbors. A simple example is graphite, in which the four valence 
electrons of each carbon atom are sp* hybridized, resulting in three strong o bonds 
to nearest neighbors within each atomic layer; the remaining p-electron contributes 
to a delocalized z orbital. The corresponding antibonding orbitals are denoted as o* 
and 3*; they are the empty states into which core electrons can be excited, giving 
rise to distinct peaks in the K-edge spectrum (see Fig. 5.37). 

In organic compounds, the presence of delocalized or unsaturated bonding again 
gives rise to sharp m* peaks at an edge threshold. For molecules containing car- 
bon atoms with different effective charge, such as the nucleic acid bases, several 
peaks are observable and have been interpreted in terms of chemical shifts (Isaacson, 
1972a, b). Molecular-orbital concepts have been useful in the interpretation of the 
fine structure of edges recorded from minerals (Krishnan, 1990; McComb et al., 
1992) and provide at least a qualitative understanding of ionization edges recorded 
from TiO2 (Radtke and Botton, 2011). Computer programs for MO calculations are 
freely available.® 


3.8.4 Multiplet and Crystal-Field Effects 


The core hole created by inner-shell ionization has an angular momentum that can 
couple with the net angular momentum of any partially filled shells within the 
excited atom. Such coupling is strongest when the hole is created within the par- 
tially filled shell itself, as in the case of N45 ionization of sixth-period elements 
from Cs (Z = 55) to Tm (Z = 69). Both spin and orbital momentum are involved, 
leading to an elaborate fine structure (Sugar, 1972). A similar effect is observed in 
the M-shell excitation of third-period elements (Davis and Feldkamp, 1976). Where 
the core hole is created in a complete shell, separate peaks may not be resolved but 
the coupling can lead to additional broadening of the fine structure. 

In lanthanide compounds, the 4f states are screened by 5s and 5p electrons and the 
Mags multiplet structure is largely an atomic effect. In transition metal compounds, 


Shttp://www.chem.ucalgary.ca/SHMO/, http://www.unb.ca/fredericton/science/chem/ajit/hmo.htm 
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Fig. 3.55 (a) Crystal-field splitting of transition metal d-states, dependent on the symmetry of 
the surrounding anions. (b) Multiplet structure of the L3 and Lz white lines of CoO and NiO, as 
measured by EELS and as calculated by Yamaguchi et al. (1982). The vertical bars represent the 
relative intensities of the calculated states, from which the smooth curves were derived by con- 
volution with an instrumental resolution function. From Krivanek and Paterson (1990), copyright 
Elsevier 


however, the 3d states of the metal are sensitive to the chemical environment (e.g., 
the electrostatic field of the surrounding ligand anions such as oxygen). The dyy, 
dxz, and dy, orbitals point at 45° to the crystal (x,y,z) axes whereas the two other 
d-orbitals lie along the axes; therefore, the degenerate d-states split into fj g and 
ég levels separated by a crystal-field splitting parameter A that depends on the 
coordination (point-group symmetry) of the anions; see Fig. 3.55a. This effect is 
observable as a splitting of both the L3 and the Ly white-line peaks in transition 
metal oxides; see Fig. 3.55b. 

Similar crystal-field splittings are observed by photoelectron spectroscopy 
(Novakov and Hollander, 1968) and by x-ray absorption spectroscopy. The multiplet 
structure can be calculated for different values of the crystal-field parameter A (de 
Groot et al., 1990), so comparison with experiment can provide information about 
the character of the bonding in minerals (Garvie et al., 1994). The availability of 
monochromated TEMs makes multiplet splitting more easily observable by EELS, 
at least for edges where the initial-state (lifetime) broadening is not excessive (Lazar 
et al., 2003; Kothleitner and Hofer, 2003). 


3.9 Extended Energy-Loss Fine Structure (EXELFS) 


Although the ionization-edge fine structure decreases in amplitude with increas- 
ing energy loss, oscillations of intensity are detectable over a range of several 
hundred electron volts if no other ionization edges follow within this region. This 
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extended fine structure was first observed in x-ray absorption spectra (as EXAFS) 
and interpreted as a densities-of-states effect, involving diffraction of the ejected 
core electron due to the long-range order of the solid. However, quite strong EXAFS 
modulations are obtained from amorphous samples and the effect is now recog- 
nized to be a measure of the short-range order, involving mainly scattering from 
nearest-neighbor atoms. 

If released with a kinetic energy of 50 eV or more, the ejected core electron 
behaves much like a free electron, the densities of states N(E) in Eq. (3.162) approx- 
imating a smooth function proportional to (E — E,)'/2 (Stern, 1974). Nevertheless, 
weak oscillations in J hs (E) can arise from interference between the outgoing spher- 
ical wave (representing the ejected electron) and reflected waves that arise from 
elastic backscattering of the electron from neighboring atoms; see Fig. 3.56. This 
interference perturbs the final-state wavefunction in the core region of the central 
atom and therefore modulates N(£). The interference can be constructive or destruc- 
tive, depending on the return path length 27; (where r; is the radial distance to the jth 
shell of backscattering atoms) and the wavelength A of the ejected electron. Since 
the velocity of the ejected electron is low compared to the speed of light, the wave 
number k of the ejected electron is given by classical mechanics: 


k=2n/. © [2mo(E—E)]'/7/h (3.165) 


where F is the energy transfer (from an incident electron or x-ray photon) and Ex, is 
the threshold energy of the edge. With increasing energy loss E, the interference is 
therefore alternately constructive and destructive, giving maxima and minima in the 
intensity of the scattered primary electrons. 


Fig. 3.56 Pictorial / 
representation of the electron 
interference that gives rise to 
oscillations of fine structure 
in a core-loss or x-ray 
absorption spectrum. 
Wavefronts of the outgoing x 
wave, representing a core \ 
electron ejected from the 

central atom C, are shown as 

dashed circles. The solid arcs 

depict waves backscattered 

from nearest-neighbor atoms 
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We assume that instrumental resolution is not a limiting factor and that the 
effects of plural inelastic scattering of the transmitted electron are negligible or have 
been removed by deconvolution to yield the single-scattering distribution J, A (E), as 
described in Chapter 4. The oscillatory part of the intensity (following the ionization 
edge) is then represented in a normalized form as 


x(E) = [JE - A] /AE) (3.166) 


where A(£) is the energy-loss intensity that would be observed in the absence of 
backscattering, as calculated using a single-atom model. Using Eq. (3.165), the 
oscillatory component can also be written as a function x(k) of the ejected-electron 
wavevector. As in x-ray absorption, the main contribution to i (E) is from dipole 
scattering (Section 3.7.2), so the same theory can be used to interpret extended 
energy-loss fine structure (EXELFS). 

Approximating the ejected-electron wavefunction at the backscattering atom by 
a plane wave and assuming that multiple backscattering can be neglected, EXAFS 
theory gives (Sayers et al., 1971) 


x(k) = me exp(—2rj/Ai) exp(—207k°) sin[2 kr + o(4)] (3.167) 


yd 


The summation in Eq. (3.167) is over successive shells of neighboring atoms, the 
radius of shell j being r;. The largest contribution comes from the nearest neighbors 
(j = 1), unless these have a low scattering power (e.g., hydrogen). Nj; is the number 
of atoms in shell j and the r-dependence of Nj; or N;/ ry is the radial distribution 
function (RDF) of the atoms surrounding the ionized atom. In the case of a perfect 
single crystal, the RDF should consist of a series of delta functions corresponding 
to discrete values of shell radius. 

In Eq. (3.167), fj(k) is the backscattering amplitude or form factor for elastic 
scattering through an angle of z rad; it has units of length and can be calculated 
(as a function of k), knowing the atomic number Z of the backscattering element. 
The results of such calculations were tabulated by Teo and Lee (1979). For lower 
Z elements, screening of the nuclear field can be neglected, so the backscattering 
approximates to Rutherford scattering for which f(k) « k~?, as shown by Egg. (3.1) 
and (3.3). 

The damping term exp(—2r;/A;) in Eq. (3.167) represents inelastic scattering of 
the ejected electron along its outward and return path, which changes the value 
of k and thereby weakens the interference, so this inelastic scattering is some- 
times referred to as absorption. Instead of incorporating a damping term explicitly 
in Eq. (3.167), absorption can be included by making k into a complex quantity 
whose imaginary part represents the inelastic scattering (Lee and Pendry, 1975). 
Absorption arises from both electron—electron and electron-phonon collisions and 
in reality the inelastic mean free path is a function of k (Fig. 3.57). Because the 
mean free path is of the order of 1 nm for an electron energy of the order of 100 eV 
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(see Fig. 3.57), inelastic scattering provides one limit to the range of shell radii that 
can contribute to the RDF. Another limit arises from the lifetime tp of the core 
hole. 

The Gaussian term exp(—207k") in Eq. (3.167) is the Fourier transform of a 
radial broadening function that represents broadening of the RDF due to thermal, 
zero-point, and static disorder. The disorder parameter o; differs from the Debye— 
Waller parameter u; used in diffraction theory (see Section 3.1.5) because only the 
radial component of relative motion between the central (ionized) and backscatter- 
ing atom is of concern. In a single crystal, where atomic motion is highly correlated, 
a; may be considerably less than u;. The value of a; depends on the atomic number 
of the backscattering atom and on the type of bonding; in an anisotropic material 
such as graphite, it will also depend on the direction of the vector rj. 

The last term in Eq. (3.167), sin[2 kr; + @j(k)], determines the interference con- 
dition. The phase difference between the outgoing and reflected waves consists of 
a path-length component 27 (2r;/A) = 2kr; and a component ¢;(k) that accounts 
for the phase change of the electron wave after traveling through the atomic field. 
This phase change can in turn be split into two components, ¢,(k) and $p(k), that 
arise from the emitting and backscattering atoms. These components can be cal- 
culated using atomic wavefunctions, incorporating an effective potential to account 
for exchange and correlation (Teo and Lee, 1979). In accordance with the dipole 
selection rule, the emitted wave is expected to have p-symmetry in the case of K- 
shell ionization and mainly d-symmetry in the case of an L93 edge. The phase-shift 
component @¢, differs in these two cases. 

Through Fourier transform and curve-fitting techniques (see Section 4.6), 
Eq. (3.167), has enabled extended x-ray fine structure (EXAFS) to provide local 
structural information from many materials. It represents a single-scattering, 
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plane-wave approximation but these limitations can be removed by generalizing the 
backscattering amplitude f;(k) into an effective scattering amplitude, as described by 
Rehr and Albers (2000) and as embodied in the FEFF computer code (Ankudinov 
et al., 1998). This program calculates the inelastic scattering based on electron self- 
energy concepts rather than using an empirical mean free path and also allows 
for core-hole effects (including initial-state broadening), nondipole effects, thermal 
vibrations, and crystalline anisotropy. Used on x-ray data, FEFF can give inter- 
atomic separations to within 2 pm and coordination numbers to within +1 (Rehr 
and Albers, 2000). As it incorporates multiple scattering, it can also be used to ana- 
lyze near-edge (XANES or NEXAFS) x-ray structure and has since been adapted to 
energy-loss (EXELFS and ELNES) measurements; see Section 4.7. 


3.10 Core Excitation in Anisotropic Materials 


The properties of an anisotropic crystal vary with direction and it is possible to 
measure this directionality by core-loss EELS, through appropriate choice of scat- 
tering angle and specimen orientation. In the case of K-shell ionization, the p-type 
outgoing wave representing the ejected electron probes the atomic environment 
predominantly in the direction of the scattering vector q of the fast electron; see 
Fig. 3.58. In other words, the contribution to the EXELFS modulations from atoms 
that lie in the direction r; (making an angle ¢ with q) is given to a first approximation 
(Leapman et al., 1981) by 


6=1mrad / 0=5mrad 


Fig. 3.58 Angular dependence of the ejected-electron intensity per unit solid angle, for carbon 
K-shell ionization by 100-keV electrons, for an energy loss of 385 eV and various scattering angles. 
The dashed line represents the direction of the scattering vector q of the fast electron, whose path 
is shown by the dotted line. From Maslen and Rossouw (1983), copyright Taylor and Francis 
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x(k) « (q- rj)” x cos" (3.168) 


For very small scattering angles (9 << Og), atoms lying along the direction of 
the incident wavevector kg make the major contribution to x(k), whereas at larger 
scattering angles atoms lying perpendicular to ko contribute the most; see Fig. 3.58. 
However, the scattered intensity in the second case is much less. To ensure equal 
intensities in both spectra, a small collection aperture can be used to record the core- 
loss spectrum at scattering angles of +@ and —6 relative to the optic axis. Choosing 
6 to be equal to the characteristic angle (@g = E/ymov~) results in the angle ¢ 
between q and ko being +45°, since y ~ tan~!(6/0¢) from geometry of the triangle 
PQR in Fig. 3.39 (points S and R come together for small @). 

If the specimen is tilted 45° relative to the incident beam, these +6 spectra will 
provide information weighted toward directions parallel and perpendicular to the 
specimen plane. In the case of a hexagonal layer material such as graphite, this 
means information about the bonding within and perpendicular to the basal-plane 
layers. Orientation-dependent EXELFS spectra were first obtained from test spec- 
imens of graphite (Disko, 1981) and boron nitride (Leapman et al., 1981). An 
orientation dependence observed in the near-edge fine structure of boron nitride 
(Leapman and Silcox, 1979) was interpreted in terms of the directionality of 
chemical bonding. 

Nonrelativistic expressions (Leapman et al., 1983) for the angular distribution of 
the o and z intensities, for a uniaxial specimen rotated through an angle I’ about 
the y-axis (see Fig. 3.60a), are 


I(r) x cos”(@ — T)/(67 + Og) (3.169) 
J(a) « sin?(¢ — T)/(67 + 67) (3.170) 


For 7 =0, J(t) « Og” / (6? + 67)", so the r-intensity has a narrow forward-peaked 
angular distribution, falling by a factor of 4 between 6 = 0 and 6 = @¢. On the other 
hand, J(o) « 6? /(07 +6.°) for l = 0, so theo intensity is zero at 0 = O and 
rises to a maximum at 0 © +6,. The sum of these two components is Lorentzian. 
Tilting the specimen makes the two angular distributions asymmetric, qualitatively 
similar to the surface plasmon case (Fig. 3.22), and allows the two components to 
be measured separately. Tilting to 45° is advantageous because it maximizes the 
separation between the two angular distributions and makes their relative intensities 
similar; see Fig. 3.59a. 

For a uniaxial crystal, Klie et al. (2003) have integrated the z-axis and basal- 
plane (xy) intensities over all azimuthal angles and up to a scattering angle £. For 
an electron beam parallel to the c-axis, these integrated intensities are equal when 
B = Og (solid curves in Fig. 3.59b). 

Contour plots of the K-loss o and z intensities, in both x- and y-directions in the 
diffraction plane, are shown in Fig. 3.60b for specimen-tilt angles up to 45°. Again, 
tilting is seen to lead to anisotropy in the scattering pattern, the o-intensity develop- 
ing a “bean” shape, elongated parallel to the axis of tilt. These results illustrate how 
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Fig. 3.59 (a) Calculated and measured angular distributions of z and o intensities in the K-loss 
spectrum of graphite, within the plane of tilt and for specimen oriented with a tilt angle y = —45° 
(Leapman et al.,1983). Available at http://link.aps.org/abstract/PRB/v28/p2361. (b) Axial (z) and 
basal-plane (xy) intensities as a function of collection angle (Klie et al., 2003). Reprinted with 
permission, copyright American Physical Society. Available at http://link.aps.org/abstract/PRB/ 
v67/p144508 


energy-filtered diffraction might be used to probe the bonding in small volumes of 
anisotropic materials (Browning et al, 1991b; Klie et al., 2003; Botton, 2005; Saitoh 
et al., 2006). 

A near-parallel incident beam (small convergence angle a) is needed to obtain 
good momentum resolution but this implies a limit on the spatial resolution: 


Fig. 3.60 (a) Specimen geometry in relation to the recording plane of an energy-filtered diffraction 
pattern. (b) Intensity in the diffraction plane, calculated from Eqs. (3.169) and (3.170) for 1s > x* 
and 1s — ox transitions in a hexagonal crystal, for three values of the y-axis specimen tilt . The 
optic axis (forward-scattering direction) is indicated by the small white dot in the center of each 
pattern. The area of each image is (10 x 10)0¢. From Radtke et al. (2006), copyright Elsevier 
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Ax * 0.61/a, as given by the Rayleigh criterion (or the uncertainty principle). 
Using Bragg’s law (A = 2dpxj9— = dyx9), the fractional angular or momentum 
resolution is (Egerton, 2007) 


Aq/qp = 40/0 = 20/6 = (1.24/Ax)(dnur/2) © 1.2dpga/ Ax 3.171) 


where qp is the wave number corresponding to a Brillouin-zone boundary and dpi; is 
the corresponding lattice-plane spacing. For 10% momentum resolution, the spatial 
resolution is limited to Ax * 12d) 4;, amounting to 1 nm or more for a typical 
material and crystallographic direction. Midgley (1999) described a method for 
obtaining angular resolution down to a few microradians by using an aperture of 
diameter d in the TEM image plane. It involves raising the specimen a distance z 
above the focus plane of the electron probe, resulting in a spatial resolution limit of 
Ax = 2az, where a is the convergence semi-angle of the probe. From the Rayleigh 
criterion, the smallest useful aperture diameter is d ~ 0.6A/a, giving an angu- 
lar resolution AO = d/z © 0.6A/(a@z) ® 1.2A/Ax and a fractional resolution 
(AO)/0 = (1.2A/Ax)(dn/d) © 1.2dnx/ Ax, consistent with Eq. (3.171). 

Although the orientation dependence of the energy-loss spectrum can be advan- 
tageous, it is also useful to collect a spectrum that is characteristic of the specimen 
and independent of its orientation. This can be done by using a collection semi- 
angle £ equal to the so-called magic angle 0. For a hexagonal layer material 
such as graphite, this means that the two components o and z are suitably aver- 
aged, such that the core-loss spectrum is independent of the specimen orientation 
I’ (Fig. 3.60a). In fact, this spectrum is the same as that of a polycrystalline mate- 
rial with grains randomly oriented. The value of Om is small: early estimates ranged 
from 1.366, to 40. The latter value is now known to be correct only at very low 
incident energy, and relativistic theory predicts a rapid fall in 6,,/0¢ with increasing 
incident energy; see Appendix A. 


3.11 Delocalization of Inelastic Scattering 


The long-range nature of the electrostatic force responsible for inelastic scatter- 
ing imposes a basic limit on the spatial resolution obtainable in an energy-selected 
image or through small-probe analysis. This delocalization of the scattering can be 
defined as the width of the real-space distribution of scattering probability, some- 
times called an object function (Pennycook et al., 1995b) or pragmatically as the 
blurring of an inelastic image after all instrumental aberrations and elastic effects 
have been accounted for (Muller and Silcox, 1995a). 

On a classical (particle) description of scattering, delocalization is represented 
through the impact parameter b of the incident electron (Fig. 3.1). Small b implies 
a strong electrostatic force and large scattering angle, so scattering should appear 
more localized if observed using an off-axis detector (Howie, 1981; Rossouw and 
Maslen, 1984), in accord with channeling studies; see Section 5.6.1. This clas- 
sical approach was initiated by Bohr (1913), who derived an expression for the 
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energy exchange E(b) between an incident electron (traveling in a straight line with 
speed v) and a bound atomic electron, represented as a classical oscillator with 
angular frequency w: 


E(b) x (w/v {[Ko(bo/v)? + Ki [bo/vP} (3.172) 


For b << (v/w), E(b) « 1/b’; in fact, the energy exchange is just the Rutherford 
recoil energy that would be imparted to a stationary and free electron, since the 
interaction time (* b/v) is short compared to the oscillation period (27w)~! of the 
atomic electron. For b >> (w/v), E(b) « exp(—2ba/v); now the atomic electron 
has time to adjust to the electric field, resulting in little energy exchange (adiabatic 
conditions). According to these classical arguments, inelastic scattering is therefore 
confined to impact parameters below bmax *~ w/v. The classical theory was further 
developed and made relativistic by Jackson (1975), who gives bmax = yv/@. 

The wave nature of the electron can be introduced by applying the Heisenberg 
uncertainty principle to the scattering event: ApAx ~ h, where Ap represents 
momentum uncertainty in the x-direction (perpendicular to the incident electron 
beam) and Ax is interpreted as the x-delocalization. An early suggestion (Howie, 
1981) was to take Ap = hg © h koOe = (h/A)Oz, giving Ax = 4/6¢. However, 
this leads to delocalization of 20-50 nm for plasmon losses, now thought to be too 
large. In fact, Og is not a typical inelastic scattering angle; the mean and median 
angles are an order of magnitude larger (see Section 3.3.1), leading to smaller Ax. 
This argument is developed further in Section 5.5.3. Here we present a simplified 
wave-optical treatment of delocalization that accounts for the essential features of 
the object or point-spread function associated with inelastic scattering. 

In light optics, the far-field (Fraunhofer) diffraction pattern represents a Fourier 
transform of the diffracting object. A familiar example is a circular hole (diameter a) 
in an opaque screen, whose aperture function is a rectangular (top-hat) function. The 
two-dimensional Fourier transform (representing amplitude at a distance r from the 
optic axis, measured at a large distance R) is then of the form J; (x)/x, where J; is 
a first-order Bessel function, x = koar/R, and kj) = 21/% is the wave number of 
the radiation. When squared, this amplitude gives an Airy function whose radius 
(measured to the first zero of intensity) corresponds to kypar/R = 3.83. Writing 
r/R = sin 6, where @ is the angle of diffraction (small for 1 << a), the width of 
the scattering object is seen to be 


a = 3.83(A/27)/ sin 6 © 0.61 2/0 (3.173) 


Equation (3.173) indicates that the angular width of scattering is inversely propor- 
tional to the size of the diffracting object; it coincides with the Rayleigh resolution 
limit of an optical system of angular aperture 6, for the case of incoherent radiation. 
Somewhat larger values of the coefficient in Eq. (3.173) are expected for coherent 
or partially coherent illumination (Born and Wolf, 2001). 

More generally, the amplitude distribution at the exit plane of a scattering object 
(the aperture function in Fourier optics) is 


3.11 Delocalization of Inelastic Scattering 225 
A(r) x FT2[dv/dQ] (3.174) 


where FT2 represents a two-dimensional Fourier transform and dy/dQ is the scat- 
tered amplitude per unit solid angle. For a single scattering object, such as an atom, 
the product A(r)A*(r) can be regarded as an object or point-spread function (PSF) 
that represents the image-intensity distribution in the case of an ideal lens system 
and parallel (plane-wave) illumination. If we assume A(r) = A*(r), neglecting any 
change in phase with scattering angle, the PSF can be obtained from 


PSF(r) x A(r)A*(r) x {FT2[dI/dQ]!/*}? (3.175) 


with dI/dQ the scattered intensity per unit solid angle. For radially symmetric scat- 
tering, the two-dimensional Fourier transform can be written in terms of a single 
integral involving a Jo Bessel function. 

Applying Eq. (3.175) to the inelastic scattering of fast electrons (Shuman et al., 
1986), where (dI/dQ) « (62 + O77)! is a good approximation over most of the 
angular range, the Fourier transform has an analytical solution: 


PSF(r) x (kor)~? exp(—2ko Or) = (kor) exp(—2r/bmax) (3.176) 


In other words, the PSF is an inverse square function multiplied by an exponential 
term that introduces an additional factor of 1/ e* = 0.135 at r = Dmax, where bmax = 
(ko@z)~' = v/a is the Bohr adiabatic limit. This exponential attenuation was veri- 
fied by Muller and Silcox (1995a), who measured the intensity beyond the edge of 
an oxidized SiO> film; see Fig. 3.61. They obtained bmax = b, E 1.0064 with energy 
loss E in eV and by = 125 nm (b; = 129 nm expected if bmax = yv/@). This re 
dependence at lower r is consistent with hydrogenic calculations of Ritchie (1981) 
and Wentzel-potential calculations of Rose (1973), summing over all energy loss. 
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Due to decrease in the oscillator strength, the angular distribution of inelastic 
scattering falls of more rapidly than the Lorentzian function at large scattering 
angles, which removes the singularity in Eq. (3.176) at r= 0. For atomic transitions, 
this cutoff occurs not far from the Bethe ridge angle 0, ~ (E /Eo)'/ 2 & (20p)!/?. 
Using Eq. (3.175) with dI/dQ « (67 + 6¢*)~'(67 + 6,7)! gives a PSF with a 
sharp central peak and tails that extend over atomic dimensions for typical core-loss 
scattering, or stretch beyond | nm for valence-electron losses, as shown in Fig. 3.62. 

The inelastic object function can also be calculated from first principles (Maslen 
and Rossouw, 1983, 1984; Kohl and Rose, 1985; Muller and Silcox, 1995a; Cosgriff 
et al., 2005; Findlay et al., 2005; Xin et al., 2010). In general, it appears to 
approximate to a Lorentzian function with exponentially attenuated tails: 

PSF(r) x (1 + 17 /ro7)7! exp(—2ko 67) (3.177) 
where 7 is inversely related to the cutoff 0, in the Lorentzian angular distribution. 
For low energy losses, 6; >> 6g and the radial dependence is close to r* over a 
considerable range, a large fraction of the intensity being present in the PSF tails; 
see Fig. 3.62. At high energy loss, 6,/6g is smaller and more of the intensity lies 
within the central peak. 

The above equations assume that all angles of scattering are recorded. If the 
imaging system contains an angle-limiting aperture, the angular width of scatter- 
ing is reduced and the PSF is broadened. For a very small collection aperture, the 
angular distribution becomes rectangular and the PSF width should be given by 
Eq. (3.173). For energy losses far above a major ionization edge, the angular dis- 
tribution of inelastic scattering departs from Lorentzian, becoming a Bethe ridge 
(Fig. 3.36), which should lead to a more localized signal (Kimoto et al., 2008). 

The delocalization of inelastic scattering has various consequences for TEM- 
EELS analysis. When a small (sub-nanometer) electron probe is incident on a 
specimen, the energy-loss spectrum will contain contributions from outside the 
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probe, especially at low energy loss where the angular distribution of inelastic scat- 
tering is narrow and the PSF correspondingly broad. Aloof excitation of surface 
plasmons (Section 3.3.6) represents a special case of this. In the case of energy- 
selected elemental maps, atomic resolution is possible only for higher EF edges, 
where @¢ exceeds about 2 mrad and the width of the central peak of the PSF is 
below 0.2 nm. For high-Z elements, the average energy loss amounts to several hun- 
dred electron volts and a considerable fraction of the intensity occurs within the 
central peak of the PSF, allowing the possibility of secondary electron imaging of 
single atoms (Inada et al., 2011). 

In the case of crystalline specimens thicker than a few nanometers, a detailed 
description of core-loss imaging requires a more sophisticated treatment. Inelastic 
scattering is represented in terms of a nonlocal potential W(r, r), a function of 
two independent spatial coordinates (r and r), and related to a density matrix 
(Schattschneider et al., 1999) or mixed dynamical form factor (Kohl and Rose, 
1985; Schattschneider et al., 2000). The MDFF represents a generalization of the 
dynamic form factor, necessary in crystals because the inelastically scattered waves 
are mutually coherent and interfere with each other. Equation (3.20) then becomes 


a «ff f veo Wr, r’) Wor’, z) dr dr'dz (3.178) 


where the integrations are over radial coordinates perpendicular to the incident beam 
direction z and over specimen thickness (0 < z < f). Equation (3.178) incorporates 
the effect of the phase of the transmitted electron and its diffraction by the specimen, 
making the cross section sensitive to the angle between the electron and the crystal. 
If the spectrometer collection aperture cuts off an appreciable part of the scatter- 
ing, the inelastic intensity is not in general proportional to the z-integrated current 
density, implying that energy-filtered STEM images cannot always be interpreted 
visually and may require computer modeling to be understood on an atomic scale 
(Oxley and Pennycook, 2008; Wang et al., 2008c). 

One feature appearing in such calculations is a volcano or donut structure (a dip 
in intensity at the center of an atom or atomic column), which arises in the case of 
a limited collection angle because electrons incident at the atomic center are scat- 
tered preferentially to higher angles and are intercepted by the collection aperture 
(D’ Alfonso et al., 2008). The fact that an off-axis detector provides a more local- 
ized inelastic signal was verified experimentally by Muller and Silcox (1995a). The 
practical importance of delocalization for elemental analysis, in combination with 
other resolution-limiting factors, is discussed further in Section 5.5. 

Simplifying the situation by treating elastic and inelastic scattering separately, 
we can expect a reasonable probability of an electron undergoing both types of 
scattering, unless the specimen is ultrathin (<10 nm). As elastic scattering involves 
relatively high angles and is more localized, contrast with high spatial frequency can 
therefore occur in an inelastic image. Examples include the appearance of diffrac- 
tion contrast in a plasmon-loss image (Egerton, 1976c) and phase-contrast lattice 
fringes in a core-loss image (Craven and Colliex, 1977). Since it arises from double 
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scattering, the elastic contrast should diminish as the specimen thickness becomes 
very small, unlike other forms of dark-field contrast. 

One further aspect of delocalization is the extent to which core-loss fine structure 
changes with position in the specimen. This structure can be regarded as aris- 
ing from interference with electron waves backscattered from neighboring atoms 
(Section 3.8.2). Multiple scattering calculations indicate that backscattering is sig- 
nificant within a diameter of 1—2 nm but that contributions from more distant atomic 
shells have almost random phase and contribute little to the interference, so the fine- 
structure oscillations can change on a sub-nanometer scale (Wang et al., 2008a). 
ELNES measurements across a sharp grain boundary appear to confirm this predic- 
tion; see Fig. 3.63. This delocalization can be regarded as a separate limit to the 
spatial resolution of ELNES data and might be added in quadrature to the value 
given by Eq. (5.17), for example (Wang et al. 2008a). 

Another example is shown in Fig. 3.64, where the measured and calculated fine 
structure appears quite dissimilar at differently bonded carbon atoms, located within 
a layer of graphene and at its edge (Suenaga and Koshino, 2010). These results 
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Fig. 3.63 Fine structure at the oxygen K- and titanium L23 edges, measured at locations close 
to a boundary between Bao.4Sro.6Tio.sNbo,503 and MgAl2Oq4 (horizontal in the 4.5 nm x 10 nm 
HAADF image) and showing a transition between Ti** and Ti** valency. From Shao et al. (2011), 
copyright Elsevier 
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Fig. 3.64 (a) ADF image of a single-graphene layer at the edge of a HOPG flake, recorded using 
a 60-kV STEM fitted with DELTA corrector. The scale bar represents 0.5 nm. (b) Same image 
with each C atom outlined with a bright circle and three selected atoms (green, blue, and red) 
arrowed. (c) Graphene-sheet model with single-, double-, and triple-bonded atoms identified. (d) 
K-edges recorded (in <1 s with 40-mrad collection angle) from the green, blue, and red atoms. 
(e-h) Near-edge structure calculated using DFT theory for a (e) single-bonded atom at a Klein 
edge, (f) double-bonded atom at a zigzag edge, (g) double-bonded atom at an armchair edge, and 
(h) triple-bonded atom away from the edge, each kind of atom being identified by a pink circle 
on the right. The symbol o represents an excitonic peak. Reproduced from Suenaga and Koshino 
(2010), copyright Nature Publishing 


indicate that core-loss fine structure may be very localized in some structures, allow- 
ing bonding information to be obtained from a single atom if radiation damage 
permits. In this case, the STEM was operated at 60 kV, below the threshold for 
displacement damage in graphene (for atoms located away from an edge). 


Chapter 4 
Quantitative Analysis of Energy-Loss Data 


This chapter discusses some of the mathematical procedures used to extract 
quantitative information about the crystallographic structure and chemical compo- 
sition of a TEM specimen. Although this information is expressed rather directly 
in the energy-loss spectrum, plural scattering complicates the data recorded from 
specimens of typical thickness. Therefore it is often necessary to remove the effects 
of plural scattering from the spectrum or at least make allowance for them in the 
analysis procedure. 

We start with the low-loss region, which might be defined as energy losses 
below 50 or 100 eV. Within this region, the main energy-loss mechanism involves 
excitation of outer shell electrons: the valence electrons or (in a metal) conduc- 
tion electrons. In many solids, a plasmon model (Section 3.3.1) provides the best 
description of valence electron excitation, a process that occurs with relatively high 
probability because the plasmon mean free path is often comparable with the sample 
thickness. 


4.1 Deconvolution of Low-Loss Spectra 
Deconvolution techniques based on the Fourier transform will be described first, 


since they are widely used. Alternative methods that are applicable to the low-loss 
region are outlined in Sections 4.1.2 and 4.1.3. 


4.1.1 Fourier Log Method 


Assuming independent scattering events, the electron intensity J,, integrated over 
energy loss and corresponding to inelastic scattering of order n, follows a Poisson 
distribution: 


In = IPy = (I/n!)(t/2)" exp(—t/A) (4.1) 


R.F. Egerton, Electron Energy-Loss Spectroscopy in the Electron Microscope, 231 
DOI 10.1007/978-1-4419-9583-4_4, © Springer Science+Business Media, LLC 2011 
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where / is the total integrated intensity (summed over all 1), P, is the probability of 
n scattering events within the specimen (thickness 1), ! denotes a factorial, and A is 
the mean free path for inelastic scattering. ! 

The case n=0 corresponds to the absence of inelastic scattering and is repre- 
sented in the energy-loss spectrum by the zero-loss peak: 


Z(E) = IpR(E) (4.2) 


where the resolution or instrument response function R(£) has unit area and a full 
width at half maximum (FWHM) equal to the instrumental energy resolution AE. 

Single scattering corresponds to n=1 and is characterized by an intensity 
distribution S(E). From Eq. (4.1) 


/ S(E) dE = ly = I(t/) exp(—t/) = In(t/®) (4.3) 
0 


Owing to the limited instrumental resolution, single scattering occurs within the 
experimental spectrum J(E) as a broadened distribution J! (E) given by 


J'(E) = R(E)*S(E) = / RE — E’)S(E)dE’ (4.4) 


—oo 


where * denotes a convolution over energy loss, as defined by Eq. (4.4). 

Double scattering has an energy dependence of the form S(E)*S(E). However, 
the integral of this self-convolution function is (1 y= (ot/ 2)*, whereas 
Eq. (4.1) indicates that the integral J) should be Jo(t/A)*/2!. Measured using an 
ideal spectrometer system, the double-scattering component would therefore be 
D(E) = S(E)*S(E)/(2'), but as recorded by the instrument, it is 


J?(E) = R(E)*D(E) = R(E)* S(E)* S(E)/(2'Ip) (4.5) 


Likewise, the triple-scattering contribution is equal to T(E) = S(E)*S(E)* 
S(E)/(3'o), but is recorded as J3(E) = R(E)*T(E). 

The recorded spectrum, including the zero-loss peak, can therefore be written in 
the form 


'Here 4 characterizes all inelastic scattering in the energy range over which the intensity is inte- 
grated, and is given by Eq. (3.96) in the case where several inelastic processes contribute within 
this range. As discussed in Section 3.4, Poisson statistics still apply to a spectrum recorded with an 
angle-limiting aperture, provided an aperture-dependent mean free path is used. 
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J(E) = Z(E) + J'(E) + P(E) + P(E) ++: 
= R(E)*[[pd(E) + S(E) + D(E) + T(E) +--+] 
= Z(E)*[8(E) + S(E)/Ip + S(E)*S(E)/(2!5) 
+ S(E)*S(E)*S(E)/(318) + +++] 


(4.6) 


where 6(E£) is a unit area delta function. 
The Fourier transform of J(E) can be defined (Bracewell, 1978; Brigham, 
1974) as 


[ee) 


j= / J(E) exp(2ivE) dE (4.7) 


—oco 


Taking transforms of both sides of Eq. (4.6), the convolutions become products 
(Bracewell, 1978), giving the equation 


iY) = 21 + 8(v)/Io + [sv (212) + [sv P /(BUB) + 


= 2v) exp[s(v)/Io] (4.8) 


in which the Fourier transform of each term in Eq. (4.6) is represented by the equiv- 
alent lower case symbol, and is a function of the transform variable (or “frequency’’) 
v whose units are eV~!. Equation (4.8) can be “inverted” by taking the logarithm of 
both sides (Johnson and Spence, 1974), giving 


s(v) = Io Inf) /z)] (4.9) 


4.1.1.1 Noise Problems 


One might envisage taking the inverse Fourier transform of Eq. (4.9) in order to 
recover an “ideal” single-scattering distribution, unbroadened by instrumental res- 
olution. However, as discussed by Johnson and Spence (1974) and by Egerton and 
Crozier (1988), such “complete” deconvolution is feasible only if the spectrum 
is coarsely sampled or noise free. In practice, J(E) contains noise (due to count- 
ing statistics, for example) that extends to high frequencies, corresponding to large 
values of v. Although not necessarily a monotonic function of v, the noise-free com- 
ponent of j(v) eventually falls toward zero as v increases. As a result, the fractional 
noise content in j(v) increases with v, and at high “frequencies” j(v) is dominated 
by noise. Because z(v) also falls with increasing v, the high-frequency noise content 
of j(v) is preferentially “amplified” when divided by z(v), as in Eq. (4.9), and the 
inverse transform S(£) is submerged by high-frequency noise. Essentially, Eq. (4.9) 
fails because we are attempting to simulate the effect of a spectrometer system 
with perfect energy resolution and to recover fine structure in J(£) that is below 
the resolution limit. 

Fortunately, deconvolution based on Eq. (4.9) can be made to work if we are 
content to recover the instrumentally broadened single-scattering distribution J! (E), 
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with little or no attempt to improve the energy resolution. Several slightly different 
procedures have been employed to obtain J! (E): 


(a) Gaussian modifier. If Eq. (4.9) is multiplied by a function g(v) that has 
unit area and falls rapidly with increasing v, the high-frequency values of 
InG/z) are attenuated and noise amplification is controlled. The inverse trans- 
form then corresponds to G(E)*S(E), the single-scattering distribution recorded 
with an instrument whose resolution function is G(E), known as a modifica- 
tion or reconvolution function. A sensible choice is the unit area Gaussian: 
G(E) = (0 \/1)~! exp(—E*/o7) whose FWHM is W = 20,/(In 2) = 1.6650, 
in which case the single-scattering distribution (SSD) is obtained as the inverse 
transform of 


i'() = g(v)s(v) = Ip exp(—27? 0? v*) Infj(v)/z(v)] (4.10) 


A limited improvement in energy resolution is possible by choosing o such 
that W < AE, but at the expense of increased noise content. If o is cho- 
sen so that W = AE, the inverse transform J'(E) is the SSD that would 
be recorded using an instrument having the same energy resolution AE but 
with a symmetric (Gaussian) resolution function. Besides removing plural 
scattering from the measured data, procedure (a) therefore corrects for any 
distortion of peak shapes caused by a skew or irregularly shaped instrument 
function. 


(b) Zero-loss modifier. If GE) = R(E), Eq. (4.10) becomes 


we 


i () = r)sv) = cv) In FO)/2)] (4.11) 


Taking the inverse transform gives the single-scattering distribution J'(E) that 
would be recorded from a vanishingly thin specimen. In Eq. (4.11) we are 
using the zero-loss peak as the modification function and the easiest way of 
obtaining Z(E) is from the experimental spectrum J(£), setting channel con- 
tents to zero above the zero-loss peak. For thicker specimens, it may be more 
accurate to record Z(E) in a second acquisition with no specimen present; 
difference in height between the two zero-loss peaks can be shown to result 
in artifacts that are confined to the zero-loss region (Johnson and Spence, 
1974). 

(c) Replacement of Z(E) by a Gaussian. Approximating z(v) both outside and 
within the logarithm of Eq. (4.11) by a Gaussian function of the same width 
and area, j'(v) can be obtained from the equation 


iQ) = Ip exp(—2707v"){In[i(v)] — Info] + 27077} (4.12) 
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where o = AE/1.665. This procedure avoids the need to isolate or remeasure 
Z(E) and calculate its Fourier transform but can generate artifacts if Z(E) is not 
close to Gaussian. 

Replacement of Z(E) by a delta function. If the zero-loss peak Z(E) in the orig- 
inal spectrum is replaced by [94(E) before calculating the transform j“(v), one 
can use the approximation (Leapman and Swyt, 198 1a): 


(d 


wm 


i) © Io Inf“()/Io] (4.13) 


As in (c), only one forward and one inverse transforms are needed and noise 
amplification is avoided. However, the use of Eq. (4.13) is equivalent to treating 
the experimental spectrum as if it had been recorded using a spectrometer sys- 
tem with perfect energy resolution and the resulting SSD will differ somewhat 
from that derived using Eq. (4.11) if J(E) contains sharp peaks, comparable in 
width to the instrumental resolution (Egerton et al., 1985). 


4.1.1.2 Practical Details 


The Fourier transforms j(v) and z(v) are (in general) complex numbers of the form 
Ji + ij2 and zj + iz2, where i = (a1. Therefore we have 


JW) _ fizi + Jaze + ioe — faze) 


= (4.14) 
zy) at 
Infi(v)/z(v)] = Inr + i0 (4.15) 
where 
1/2 
[ (Giza +222)" + Grzi - jiz2)"| 
ne ae (4.16) 
1+ % 
and 
6 = tan7! eva (4.17) 
iz +j2z2 


If necessary, Eqs. (4.14), (4.15), (4.16), and (4.17) can be used to evaluate 
Eqs. (4.10) and (4.11) without the use of complex functions. 

In practice, spectral data are held in a limited number N of “channels,” each 
corresponding to electronic storage of a binary number. In this case j(v) is a discrete 
Fourier transform (DFT), defined (Bracewell, 1978) by 


m=N—-1 
jin) =N7! > J(m) exp(—22 imn/N) (4.18) 


m=0 
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where J(m) is the spectral intensity stored in data channel m (m being linearly related 
to energy loss) and the integer n replaces v as the Fourier “frequency.”* Because of 
the sampled nature of the recorded data and its finite energy range, J(E) can be 
completely represented in the Fourier domain by a limited number of frequencies, 
not exceeding n = N — 1. Moreover, the spectral data J(E) are real (no imaginary 
part), so that j)(—v) = j1(v), ja(—v) = —j2(v), and j2(0) = 0 (Bracewell, 1978). 
Sometimes the negative frequencies are stored in channels N/2 to N, so these rela- 
tions become j\(N — n) = ji(”), ja(N — n) = —jo(n), and j2(0) = 0. As a result, 
only (V/2 + 1) real values and N/2 imaginary values of j(n) need be computed and 
stored, requiring a total of N + | storage channels for each transform. The require- 
ment becomes just NV channels if the zero-frequency value j,(0), representing the 
“dc” component of J(E), is discarded (it can be added back at the end, after taking 
the inverse transform). The fact that the maximum recorded frequency is n = N/2 
(the Nyquist frequency) means that frequency components in excess of this value 
ought to be filtered from the data before computing the DFT (Higgins, 1976) in 
order to prevent spurious high-frequency components appearing in the SSD (alias- 
ing). In EELS data, however, this filtering is rarely necessary because frequencies 
exceeding N/2 consist mainly of noise (the spectra are oversampled). 

Although the limits of integration in Eq. (4.7) extend to infinity, the finite range 
of the recorded spectrum will have no deleterious effect provided J(E) and its deriva- 
tives have the same value at m = 0 and at m = N—1. In this case, J(E) can be thought 
of as being part of a periodic function whose Fourier series contains cosine and sine 
coefficients that are the real and imaginary parts of j(7). The necessary “continuity 
condition” is satisfied if J(E) falls practically to zero at both ends of the recorded 
range. If not J(E) should be extrapolated smoothly to zero at m= WN — 1, using (for 
example) a cosine bell function: A[1 — cos r(N — m— 1)], where r and A are con- 
stants chosen to match the data near the end of the range. Any discontinuity in J(E) 
creates unwanted high-frequency components which, following deconvolution, give 
rise to ripples adjacent to any sharp features in the SSD. 

In order to record all of the zero-loss peak, the origin of the energy-loss axis must 
correspond to some nonzero channel number 7. The result of this displacement of 
the origin is to multiply j(n) by the factor exp(27rimopn/N). However, Z(E) usually 
has the same origin as J(E), so z(n) gets multiplied by the same factor and the effects 
cancel in Eq. (4.10). In Eq. (4.11), where z(v) also occurs outside the logarithm, the 
combined effect is to shift the recovered SSD to the right by mo channels, so that 
its origin occurs in channel m = 2m. To avoid the need for an additional phase 
shift term in Eqs. (4.12) and (4.13), J(E) must be shifted, so that the center of the 
zero-loss peak occurs in the first channel (m = 0) before computing the transforms. 
In that case, the left half of Z(E) must be placed in channels immediately preceding 
the last one (m = N — 1). 

The number of data channels used for each spectrum is usually of the form 
N = 2*, where k is an integer, allowing a fast-Fourier transform (FFT) algorithm 


2 As an example, the DFT of the unit-area Gaussian is exp(—2n?02/N?), 
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to be used to evaluate the discrete transform (Brigham, 1974). The number of arith- 
metic operations involved is then of the order of N log>N, rather than N’ as in a 
conventional Fourier transform program (Cochran, 1967), reducing the computation 
time by a factor of typically 100 (for N= 1024). Short (<50-line) FFT subroutines 
have been published (e.g., Uhrich, 1969; Higgins, 1976). 

The zero-loss peak is absent in the inverse transform of j!(n) but can be restored at 
its correct location if an array containing Z(E) is stored separately. Z(E) can be useful 
in the SSD because it delineates the zero-loss channel and provides an indication of 
the specimen thickness and the energy resolution. 


4.1.1.3 Thicker Samples 


Strictly speaking, Eqs. (4.9), (4.10), (4.11), (4.12), and (4.13) do not specify a 
unique solution for the single-scattering distribution; it is possible to add any multi- 
ple of 2777 to the right-hand side of Eq. (4.15) and thereby change the SSD without 
affecting the quantity in square brackets (i.e., the experimental data). This ambigu- 
ity will cause problems if the true value of the phase 6 (the imaginary part of the 
logarithm) lies outside the range (normally — z to +7) generated by a complex 
logarithm function, a condition that is liable to occur if the scattering parameter 1/A 
exceeds z (Spence, 1979). If Eq. (4.17) is used, the value of @ is restricted to the 
range —zr/2 to m/2 and trouble may arise when t/A > 1/2. 

A solution to this phase problem (Spence, 1979) is to avoid making use of the 
imaginary part of the logarithm. This would happen automatically if S(E) were an 
even function (symmetric about m = N/2 or m=O), since in this case s(v) has 
no imaginary coefficients (Bracewell, 1978). In practice, S(E) is not even but can 
always be written as a sum of its even and odd parts: S(E) = S*(E) +S” (EB). If S(E) 
is zero over one-half of its range (see Fig. 4.1), S(E) = 2S+(E) and it is sufficient 
to recover S*(E), thereby avoiding the phase problem. The necessary condition is 
satisfied by doubling the length of the array, shifting the spectrum J(E) (before com- 
puting its Fourier transform), so that the middle of the zero-loss peak occurs either 
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Fig. 4.1 Relationship between the single-scattering distribution S(Z) and its even and odd parts, 
S*(E) and S"(E), for the case where S(E) = 0 in the range N/2 < m <N 
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at the exact center or at the beginning of the data range, then making sure that the 
unused array elements are set to zero. In the former case J(E) = 0 form < N/2 
(energy gains can generally be neglected; Section 3.3.6); in the latter, J(E) = 0 
form > N/2. In either case, the effects of truncation errors can be avoided by 
terminating the data smoothly, as discussed earlier. 

The Fourier transform s*(v) of S*(E) is obtained as follows. From Eq. (4.15), 


In[j(v)/z(v)] = In(r) + 10 = stv) + 57-0) (4.19) 


Since s*(v) is entirely real and s~ (v) entirely imaginary, 


s*(v) = In) = In[¥)/zI] (4.20) 


As an alternative to computing the modulus in Eq. (4.20), In(r) can be calcu- 
lated directly from Eq. (4.16). An inverse transform gives J'(E) correctly in the 
region originally occupied by the nonzero J(E) data but a mirror image of J'(E) 
appears in the other half of the range and should be discarded. Spence (1979) 
has shown that a correct SSD can be obtained even from samples with t/A = 10 
using this method, which is implemented in the FLOGS program described in 
Appendix B. 

An alternative way of extending deconvolution to thicker specimens is to eval- 
uate @ using Eq. (4.17) and instruct the computer to correct for each discontinuity 
in the array, by adding or subtracting 7 (Egerton and Crozier, 1988). However, this 
correction becomes problematic if the true change in phase between adjacent coef- 
ficients approaches zr, as is the case for thicker specimens if the energy range of the 
original data is too restricted (Su and Schattschneider, 1992a). 


4.1.1.4 Effect of Thickness Variations 


Our analysis has assumed that the specimen is uniform over the area from which the 
spectrum J(E£) is recorded. The effect of a variation in thickness can be visualized 
in extreme form by imagining part of the sampled are to have zero thickness, cor- 
responding (for example) to a hole in the specimen. Electrons transmitted through 
the hole will contribute to Z(E) but not to the inelastic intensity. Even if the param- 
eter ¢ represents some average thickness, Eq. (4.1) will not be exact and the SSD 
derived using the Fourier log method will be somewhat in error. It appears that dou- 
ble scattering is undersubtracted, typically by 5% if Z(E) exceeds the true value 
by 25%. 

The effect of a small variation At in thickness was calculated by Johnson and 
Spence (1974). As a fraction of the double-scattering intensity J, the residual 
second-order component AJy is: Aln/In © (1/12)(At/t)* and this fraction is less 
than 1% for At < 0.35r. 
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4.1.1.5 Effect of an Angle-Limiting Aperture 


The deconvolution methods discussed above assume that the intensities recorded 
in the energy-loss spectrum obey Poisson statistics. This assumption is justified if 
the spectrometer records all of the transmitted electrons, but when an angle-limiting 
aperture precedes the spectrometer, the relative intensities of the different orders of 
scattering are altered. 

An angle-selecting aperture that is centered about the optic axis (zero scattering 
angle) admits all of the unscattered electrons but accepts only a fraction F;, of those 
that were inelastically scattered n times, as discussed in Section 3.4. In Eq. (4.9), 
z(v) will be unaffected by the aperture but j(v) will be modified. Algebraic analy- 
sis (Egerton and Wang, 1990) shows that Fourier log deconvolution leaves behind 
a fraction R2 of the double scattering and a fraction R3 of the triple scattering, 
given by 


Ry = [Fy — Fy7]/Fo, R3 = [F3 — 3F Fo + 2F\71/F3 (4.21) 


If Eq. (3.97) is satisfied, the bracketed terms in Eq. (4.22) are close to zero provided 
B >> 6g, where f is the aperture semi-angle and Of is the characteristic scat- 
tering angle at an energy loss E. Fortunately, this condition holds for the low-loss 
region, with typical aperture angles and incident energies above 50 keV; see Fig. 4.2. 
Numerical computation of F2 and F3, assuming a Lorentzian angular distribution 
with an abrupt cutoff, indicates that less than 3% of the second- and third-order 
scattering remains after Fourier log processing, provided B > 106g. This predicion 
has been confirmed experimentally (Egerton and Wang, 1990; Su et al., 1992). 

If the collection aperture is displaced from the optic axis, as in angular-resolved 
spectroscopy, Eq. (3.97) no longer holds and the problem of calculating the SSD 
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Fig. 4.2. Plasmon-loss spectra of aluminum before (dashed line) and after Fourier log deconvo- 
lution, recorded using 120-keV electrons with (a) no objective lens aperture (8 © 1700@¢ ~ 120 
mrad) and (b) an objective aperture of diameter 20 jum (6 © 546 ¢ © 3.7 mrad) 
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becomes more complicated. The convolution integrals must be generalized to 
include scattering angle 0, treated as a vector with radial and azimuthal components 
(Misell and Burge 1969). In addition, it may be necessary to deal explicitly with 
elastic and quasielastic scattering (Bringans and Liang, 1981; Batson and Silcox, 
1983), which can appreciably modify the angular distribution of inelastic scattering, 
particularly in thicker specimens. In principle, the angular distributions of all these 
scattering processes must be known, although the procedure can be simplified if an 
energy-loss spectrum that includes all angles of scattering is available (Batson and 
Silcox, 1983). 

For an amorphous or polycrystalline specimen, both the elastic and the inelastic 
scatterings are axially symmetric and scattering probabilities (per unit angle) can be 
written in terms of Hankel transforms: Eq. (4.7) with the exponential replaced by 
a zero-order Bessel function (Johnson and Isaacson, 1988; Reimer, 1989). Su and 
Schattschneider (1992b) used discrete Hankel transforms to process plasmon-loss 
spectra recorded from 50 and 100 nm aluminum films at scattering angles up to 
13 mrad. 


4.1.2 Fourier Ratio Method 


A common situation in spectroscopy is that a recorded spectrum J(£) represents an 
ideal spectrum P(E) convolved with some broadening function Z(E): 


J(E) = R(E)* P(E) (4.22) 


Knowing J(E) and R(E), it is possible to recover P(E) by deconvolution, and an 
efficient way to do this is by calculating the Fourier transforms j(v) and r(v), tak- 
ing a ratio j(v)/r(v) followed by an inverse transform. As discussed in the previous 
section, this procedure is liable to generate noise problems, although these can be 
controlled by the use of a reconvolution function G(E), so that an approximation to 
P(E) is obtained as 


P(E) ~F'[g@j(v)/r)] (4.23) 


where F~! denotes an inverse transform. A simple application of Eq. (4.27) is for 
improving the resolution of spectral data (peak sharpening). Applied to a low-loss 
spectrum, R(E) can be the instrument resolution function, recorded as the zero- 
loss peak, while G(E) is a Gaussian of smaller width, resulting in a spectrum P(E) 
whose energy resolution is the width of G(E) rather than that of R(E). However, this 
results in a severe degradation of signal/noise ratio; the situation for coarsely sam- 
pled data is shown in Fig. 4.3a (dashed line) and the situation becomes worse for 
finely sampled data that contain higher-frequency noise components. 

There are, however, situations in which Eq. (4.23) is useful. If R(E) is an unsym- 
metrical function that distorts the shape of peaks in the spectrum, the use of a 
symmetrical G(E) of the same width removes this distortion without increase in 
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Fig. 4.3 (a) Signal/noise ratio as a function of the increase in energy resolution, for Fourier ratio 
and maximum entropy deconvolution (Overwijk and Reefman, 2000). (b) Low-loss spectrum of 
pentacene sharpened by the Richardson—Lucy method. Note the appearance of a satellite peak at 
about 6 eV and oscillations throughout the spectrum as the number of iterations increases from 
5 and 1500 (Egerton et al., 2006a). Copyright Elsevier 


noise, since it provides no resolution enhancement. A cold-field emission electron 
source suffers from an unsymmetrical R(E) with an extended high-EF tail that 
interferes with analysis below about 3 eV for determining the bandgap in a semi- 
conductor, for example. This tail represents information of low spatial frequency 
and can be removed without significant noise penalty, as demonstrated by Batson 
et al. (1992). Another example: tails that occur in spectra recorded using a paral- 
lel recording detector (as a result of light spreading in the scintillator) are removed 
effectively by the use of Eq. (4.23). 

Fourier ratio deconvolution can also be useful when a low-loss spectrum J(E) 
has been recorded from a particle on a substrate, and where the spectrum S(E) of 
the bare substrate is also available. The use of Eq. (4.23) with R(E) = S(E) then 
allows the spectrum P(E) of the particle to be generated (Wang et al., 2009a). This 
deconvolution makes allowance for plural scattering, whereas simple subtraction of 
Z(E) from J(E) is accurate only if each spectrum is first processed to remove plural 
scattering. 


4.1.3 Bayesian Deconvolution 


By incorporating extra information, including the fact that the energy-loss intensity 
cannot be negative, iterative Bayesian methods offer the possibility of correcting for 
spectral broadening more effectively than Fourier deconvolution. An estimate P! of 
the true spectrum P (free from instrumental broadening) can be obtained through a 
procedure that minimizes the deviation between P! and S, such as 
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where i represents channel number and o; is the standard deviation of the data J;. The 
problem again is that P contains noise, not represented in Eq. (4.24). The maximum- 
entropy (ME) method treats noise separately from the data, ensuring that P! is 
as smooth as possible (minimum information content) by minimizing the quantity 
F=y* + BS, where £ is a regularization parameter and the entropy S is given by 


S ly 1 1 4.24 

=5 dX [P].{log((Pil/Bi) — 1} (4.24) 
Here B; is a Bayesian “prior,” often taken as an average of the recorded spectrum J; 
(Overwijk and Reefman, 2000). An estimate of the noise advantage of this method 
is shown in Fig. 4.3a, where the solid line represents the super resolution coefficient: 
SR = {log,[1 +0.07(SNR)’]} /3, fitted to experimental data (Overwijk and Reefman, 
2000). The ME method is clearly preferable to Fourier deconvolution in terms of 
noise amplification. 

A second Bayesian method, maximum-likelihood (ML) deconvolution, attempts 

to maximize the joint probability of observing the measured data set, which is given 
for uncorrelated noise as 


N 


p\di...Jn) =|] pi) (4.25) 
i=1 


Maximization is achieved by applying the criterion: d{In[p (Ji) ]}/dp = 0, assum- 
ing Poisson noise and subject to the requirement that all data remain positive and 
that the integrated intensity is conserved. An iterative procedure was described 
by Richardson (1972) and shown to converge to the maximum-likelihood solution 
by Lucy (1974), being therefore known as the Richardson—Lucy (R-L) algorithm. 
Widely used in astonomy, it is available for image processing, such as correcting 
the point-spread function of a TEM camera (Zuo, 2000). Gloter et al. (2003) used 
this two-dimensional procedure to correct for the spectrometer aberration figure 
(recorded by a CCD camera) and thereby improve the resolution of the energy-loss 
spectrum. 

Applied to EELS, the ME and ML procedures require a “kernel” to act as the 
instrumental resolution function, often taken as the zero-loss peak recorded with 
no specimen. Both methods produce similar results (Kimoto et al., 2003). It may 
be preferable to use the same acquisition time for both kernel and data, to ensure 
similar noise content (Lazar et al., 2006). Usually, however, the result does not 
converge; as the number of iterations increases the spectrum starts out smooth but 
ripples build up as the energy resolution improves. These oscillations are caused 
by noise in the original spectrum and are absent in the case of noise-free data 
(Egerton et al., 2006a). As a compromise, the process is often terminated after about 
15 iterations. 
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Unlike the Fourier methods, these “software monochromator” algorithms can 
sharpen low-loss peaks by a factor of 2—3 without noise amplification. However, 
they do not remove plural scattering from the low-loss region and must be employed 
with care. From model spectra, Lazar et al. (2006) showed that the use of Bayesian 
deconvolution to improve the spectral resolution beyond that of the instrument can 
result in incorrect ratios of spectral peaks; sharpening enhances the visibility of 
small spectral features but introduces artifacts. Even for noise-free data, wing or 
satellite peaks appear adjacent to any intense peak in the spectrum, such as the zero- 
loss peak; see Fig. 4.3b. These artifacts are particularly troubling in the case of 
bandgap measurement, for which purpose a monochromated spectrometer system 
represents a safer way of obtaining the necessary energy resolution. Such problems 
are less serious when Bayesian methods are used for processing the core-loss region 
of the spectrum, where they are also capable of removing plural scattering; see 
Section 4.3.3. A MATLAB version of the Richardson—Lucy algorithm is described 
in Appendix B. 


4.1.4 Other Methods 


Misell and Jones (1969) describe a deconvolution method for removing plural 
scattering from the low-loss spectrum, applying successive corrections based on 
self-convolution. Unfortunately the result converges rapidly only if the specimen is 
very thin. Schattschneider (1983a,b) advocated a modification of the above proce- 
dure, based on matrix manipulation, which is attractive for small data sets since it 
involves no truncation errors. Neither of these procedures makes explicit allowance 
for instrumental energy resolution. 

Wachtmeister and Csillag (2011) describe an iterative method, attributed to Gold 
(1964), for increasing the energy resolution of low-loss or core-loss data. It is based 
on matrix manipulation with constraints, including positivity. Although the compu- 
tations were time consuming, the iteration was said to converge without generating 
oscillations or wing peaks. 


4.2 Kramers—Kronig Analysis 


As shown in Chapter 3, the single-scattering spectrum S(£) is related to the complex 
permittivity ¢ of the specimen. Ignoring instrumental broadening, surface-mode 
scattering and retardation effects (St6ger-Pollach et al., 2008), 
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where Jp is the zero-loss intensity, t the specimen thickness, v the speed of the inci- 
dent electron, 6 the collection semi-angle, and 6g = E/ (ymov*) is the characteristic 
scattering angle for an energy loss E. Note that S(£) and the instrumentally broad- 
ened intensity J'(E) are in units of J~!; a factor of e = 1.6 x 107!° is required to 
convert them to eV—!. 

Starting from the single-scattering distribution J'(E), Kramers—Kronig analy- 
sis enables the energy dependence of the real and imaginary parts (¢; and é2) of 
the permittivity to be calculated, together with other optical quantities such as the 
absorption coefficient and reflectivity. Although a typical energy-loss spectrum has 
worse energy resolution than that achievable using light optical spectroscopy, its 
energy range is much greater: energy losses equivalent to the visible, ultraviolet, and 
soft x-ray region can be recorded in the same experiment. Moreover, the energy-loss 
data are obtainable from microscopic regions of a specimen, which can be charac- 
terized in the same instrument using other techniques such as electron diffraction. 
Such data can be helpful in formulating band structures (Fink et al., 1983) and in 
characterizing small particles (Alexander et al., 2008) or heterostructures (Turowski 
and Kelly, 1992; Lakner et al., 1999; Lo et al., 2001). 

The first step in the process is to derive the single-scattering distribution J! (E) 
from an experimental spectrum J(E), as described in Section 4.1. If the specimen is 
very thin (below 10 nm for 100-keV incident electrons), the raw spectrum might be 
used but would contain an appreciable surface-loss contribution, reducing the accu- 
racy of the method. Some workers minimize this surface contribution by recording 
the spectrum slightly off-axis, taking advantage of the smaller angular width of the 
surface losses (Liu, 1988). 


4.2.1 Angular Corrections 


The next step in K—-K analysis is to obtain an energy distribution proportional to 
Im[—1/e] by dividing J'(E) by the logarithmic term of Eq. (4.26), which has a 
fairly weak E-dependence. Since 6g « E, this procedure increases the intensity at 
high energy loss relative to that at low loss. Sometimes referred to as an aperture 
correction, it is not quite equivalent to simulating the effect of removing the col- 
lection aperture, which would require division of J'(E) by the angular collection 
efficiency 7 = In[{1 + (B/Og)7]/ In[1 + (6/0) |, where 0, is an effective cutoff 
angle (Section 3.3.1). 

Equation (4.26) assumes that the angular divergence a of the incident beam is 
small in comparison with f. If this condition does not hold, a further angular correc- 
tion may be required (Section 4.5). Daniels et al. (1970) give an alternative form of 
correction that applies when the energy-loss intensity is measured using an off-axis 
collection aperture. 


4.2.2 Extrapolation and Normalization 


In order to evaluate subsequent integrals, the data may have to be extrapolated, so 
that J'(E) falls practically to zero at high energy loss. The form of extrapolation 
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is not critical; an AE~" dependence can be used, where r is estimated from the 
experimental data or taken as 3 (as predicted for the tail of a plasmon peak by the 
Drude model; Section 3.3.1). 

Unless the values of ft, v, and 6 in Eq. (4.26) are accurately known (Isaacson, 
1972a), Im[—1/e] can be put on an absolute scale by use of a Kramers—Kronig sum 
tule, obtained by setting E = 0 in Eq. (4.28): 


1 27 -1 ]dE 
Re = / Im ee (4.27) 
Eo 1 wl E 
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Since Re[1/e(0)] = €1(€17 + €27), the left-hand side of Eq. (4.27) can be taken as 
1 for a metal, where both ¢; and ¢2 become very large for E — 0 (see Fig. 3.11). 
In the case of an insulator, €2 is small at small E and Re[1/e(0)] ~ 1/e,(0), where 
é(0) is the square of the refractive index for visible light. The static permittivity is 
not appropriate here since the measured spectrum does not extend into the infra-red 
region (E < 0.1 eV) because of the limited energy resolution of a typical TEM- 
spectrometer system. 

Normalization involves dividing each energy-loss intensity, proportional to 
Im[—1/e(E)] after the plural scattering and angular corrections, by the correspond- 
ing energy loss E and integrating over the entire energy range as in Eq. (4.27). The 
resulting integral is divided by (7 /2){1 — Re[1/e(0)]} to yield the proportionality 
constant K = Iot/ (ragmov~) and an estimate of the absolute specimen thickness, if 
the zero-loss integral J) and the incident energy are known. The aperture-corrected 
spectrum is then divided by K to give Im[—1/e(E)]; see Fig. 4.4b. If Re[1/e(0)] is 
unknown, it may be possible to use Eq. (4.32) or (4.33) to estimate K, provided the 
upper limit of the integral can be chosen, such that essentially all contributions from 
a known set of atomic shells are included. 


4.2.3 Derivation of the Dielectric Function 


Based on the fact that the dielectric response function is causal (Johnson, 1975) a 
Kramers—Kronig transformation can be used to derive the function Re[1/e(£)] from 


—1/e(E)|? 
1 27 =f E'dE' 
Re Eos 1- =P fim ba |e = (4.28) 
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where P denotes the Cauchy principal part of the integral, avoiding the pole at E = E’ 
(Daniels et al., 1970). In Eq. (4.28), E' /(E” — E*) acts asa “weighting function,” 


3 Equation (4.28) applies only to isotropic materials; for the anisotropic case, see Daniels et al. 
(1970). 
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Fig. 4.4 (a) Energy-loss spectrum of a 50-nm film of uracil on a 2.5-nm carbon substrate, recorded 
with 25-keV electrons and 6 =0.625 mrad; the dashed line shows the spectrum after correction 
for double scattering. (b) Energy-loss function, obtained from films of two different thicknesses. 
(c) Real and imaginary parts of the dielectric function, derived by Kramers—Kronig analysis. 
(d) Effective number of electrons (per uracil molecule) as a function of the integration range; 
the dashed curve was calculated from Eq. (4.32) and the solid curve from Eq. (4.33). Reprinted 
with permission from Isaacson (1972a). Copyright 1972, American Institute of Physics 


giving prominence to energy losses EF’ that lie close to E. Values of Im[—1/e] 
corresponding to E’ < E contribute negatively to the integral whereas values cor- 
responding to E’ > E make a positive contribution, so 1 — Re[1/é] somewhat 
resembles the differential of Im[—1/e]. 

The principal value of the integral could be obtained by computing Re[1/e(E)] 
at values of E midway between the Im[—1/e«(E’)] data points (Johnson, 1975) or by 
incorporating an analytical expression for the region adjacent to E= E’ (Stephens, 
1981). However, the Kramers—Kronig integral can also be evaluated using Fourier 
transform techniques (Johnson 1974, 1975), based on the fact that Re[1/e(E)] — 1 
and Im[—1/e(E)] are cosine and sine transforms of the even and odds parts, p(t) 
and q(t), of the time-dependent dielectric response function: |/e(t) — 5(t). Because 
a response cannot precede the cause, this function is zero for t < 0, so that (as in 
Fig. 4.1) 


p(t) = sgn[q(t)] (4.29) 


The Fourier procedure is therefore to compute q(t) as the sine transform of 
Im[—1/e(E)], obtain p(t) by reversing the sign of the Fourier coefficients over 
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one-half of their range, take the inverse cosine transform, and add | to obtain 
Re[1/e(E)]. It avoids the need to compute principal parts (there are no infinities 
on the f-axis) and is rapid if fast Fourier transforms are used; see Appendix B. 

Johnson (1975) has shown that to avoid errors arising from the sampled nature 
of Im[—1/e(E£)], the sharpest peak in this function should contain at least four 
data points. If this condition is not met, the sine coefficients do not fall to zero 
at the Nyquist frequency and sign inversion introduces a discontinuity in slope that 
contributes high-frequency ripple to Re[1/e(E)]; see Fig. 4.5. This ripple becomes 
amplified at low values of E when e; and €2 are computed. 

After evaluating Re[1/¢(E)], the dielectric function is obtained from 


Re [1/e (E)] + ilm[-1/e (B)] 
{Re[1/e (E)]}? + {Im[-1/e (E)]}? 


€ (E) = & (E) + té2 (E) = (4.30) 


Equating the real and imaginary parts in Eq. (4.30) gives the separate functions 
€1(E) and €2(E); see Fig. 4.4c. Other optical quantities can also be calculated, such 
as the optical absorption coefficient: 
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Fig. 4.5 Re[1/e(E)] for a free-electron gas, the FWHM of the plasmon peak being (a) four 
channels and (b) two channels. The solid curve was calculated directly from Drude theory 
(Section 3.3.1); square data points were derived from the Drude expression for Im[—1/e(£)], using 
the Fourier method of Kramers—Kronig analysis (Egerton and Crozier, 1988) 
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4.2.4 Correction for Surface Losses 


Equations (4.26), (4.27), and (4.28) assume that energy losses take place only within 
the interior of the specimen. If the specimen thickness is below about 50 nm, 
it is desirable to make allowance for surface losses. For perpendicular incidence, 
Eq. (3.80) gives the single-scattering surface-loss intensity from both surfaces (but 
neglecting coupling between them) as 
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where T = mov~/2 and the 6/(B* + 07) term is usually negligible. For a clean 
surface, €, = 1. For a metal specimen whose surfaces are covered by an oxide of 
permittivity ¢,, it may be better to replace ¢; + | by ¢; + &q in Eq. (4.31). If the 
frequency dependence of ¢, is known, Eq. (4.31) could incorporate the energy-loss 
function given in Eq. (3.80). 

After subtracting the estimated $,(£) from the experimental single-scattering dis- 
tribution J;(Z), a new normalization constant K is found and «(E£) is recalculated. 
The whole process is repeated if necessary until the result converges (Wehenkel, 
1975). After this procedure, the value of ¢; at small E(® 2 eV) should approximate 
to the optical value used in applying Eq. (4.27). 

Whether the iteration converges depends largely on the form of the loss spec- 
trum at low energies (<10 eV). The surface-loss correction above is valid only for a 
smooth and planar surface, perpendicular to the incident beam, and ignores retarda- 
tion effects. A favorable case is a deconvolved spectrum of a thicker metal film, 
where surface losses are weak and Cerenkov losses absent. For semiconductors 
and insulators, St6ger-Pollach et al. (2008) propose taking the difference between 
the original spectrum and that calculated from the Kroger formula and subtracting 
this difference during the iteration. Further details are discussed in St6ger-Pollach 
(2008). It is also necessary to deal with the singularity in Eq. (4.31) at E=0, as 
discussed in Section B.9. 


Ss (E) 


4.2.5 Checks on the Data 


The Bethe f-sum rule (Section 3.2.4) gives rise to the concept of an effective number 
of electrons contributing to energy losses up to a value E. From Eq. (3.35), one 
definition of the effective number of electrons per atom (or per molecule) is 
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where na is the number of atoms (or molecules) per unit volume of the sample. 
Alternatively, Eq. (3.34) can be applied to optical transitions (g ~ 0), leading to a 
second effective number: 


E E 
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Because of the 1/E weighting factor in the relationship between do /dE and df /dE, 
Neff(—Im ey) remains less than neg(€2) at low values of E but the two numbers 
converge toward the same value at higher energy loss; see Fig. 4.4d. In favor- 
able cases (elements and simple compounds), this plateau corresponds to a known 
number of electrons per atom, providing a check on the derived values of ¢2(E) 
and Im[—1/e(E)]. In elemental carbon, for example, ners ~ 4, corresponding to 
excitation of all the valence electrons, for energies (~ 200 eV) approaching the 
K-ionization threshold (Hagemann et al., 1974); see Fig. 5.8. In compounds con- 
taining several elements, inner-shell excitation can occur before the valence electron 
contribution is exhausted, so plateau values are never reached and it is possible to 
tell if the derived values of nes¢ are substantially too high but not whether they are 
too low. This difficulty is removed if the analysis can be carried out up to an energy 
loss several times the largest inner-shell binding energy of the elements involved; 
the final saturation value should then correspond to the total number of electrons per 
atom (or molecule). 


4.3 Deconvolution of Core-Loss Data 


As noted in Section 3.7.3, plural scattering can drastically alter the observed shape 
of an inner-shell ionization edge and may need to be removed before near-edge or 
extended fine structure can be interpreted. This plural scattering involves electrons 
that undergo one or more low-loss collisions (e.g., plasmon excitation) in addition 
to causing core-level ionization. The probability of more than one core ionization 
can be neglected in transmission spectroscopy, where the sample thickness is small 
compared to the inner-shell mean free path. Because the core-loss region typically 
contains many data points, equally spaced in energy, fast-Fourier methods are a 
natural choice for spectral processing. 


4.3.1 Fourier Log Method 


The deconvolution method of Johnson and Spence (1974), described in 
Section 4.1.1, assumes only that scattering events are independent, and that the prob- 
ability of plural scattering is given by Poisson statistics. This method is therefore 
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capable of removing plural scattering from anywhere within the energy-loss spec- 
trum, including the mixed (core + plasmon) scattering beyond an ionization edge. It 
involves calculating a Fourier transform of the entire spectrum, from the zero-loss 
peak up to and beyond the ionization edge(s) of interest. To prevent truncation errors 
from affecting the SSD within the range of interest, the spectrum must be recorded 
up to an energy loss well beyond these edges or else extrapolated smoothly toward 
zero intensity at some high energy loss. 

Any discontinuities in the spectrum must be removed before calculating its trans- 
form. For example, low-loss and core-loss regions obtained by separate readouts 
from a parallel recording spectrometer must be “spliced” together. The resulting 
spectrum will often have a large dynamic range (e.g., 107) but Fourier proce- 
dures usually provide the necessary precision, using the procedures described in 
Section 4.1.1. 

Unlike the Fourier ratio method described in Section 4.3.2, Fourier log deconvo- 
lution removes plural scattering from both the core-loss region and the preceding 
background. Because the core-loss intensity just above the ionization threshold 
arises only from single inner-shell scattering, the “jump ratio” of an edge increases 
after Fourier log processing, the increase being dramatic in the case of moderately 
thick samples; see Fig. 4.6. In this respect, the deconvolved spectra are equivalent 
to those that would be obtained using a thinner sample or a higher incident energy. 
However, the noise components arising from the plural scattering remain behind 
after deconvolution, so statistical errors of background subtraction (Section 4.4.4) 
remain much the same. Therefore Fourier log deconvolution improves the sensi- 
tivity and accuracy of elemental analysis only to the extent that systematic errors in 
background fitting may be reduced, for example, if the single-scattering background 
approximates more closely to a power-law energy dependence (Leapman and Swyt, 
198 1a). 

In addition to increasing the fractional noise content of the pre-edge background, 
Fourier log deconvolution tends to accentuate any artifacts present in the spectrum, 
e.g., due to power supply fluctuations or nonlinearity in the intensity scale. An exam- 
ple is shown in Fig. 4.6, where splicing of the low-loss spectrum to the core-loss 
region resulted in a change in slope. Deconvolution converts this change in slope 
into a “hump” extending over tens of eV, which might be mistaken for an ionization 
edge. Although somewhat extreme, this example illustrates the need for high-quality 
data prior to deconvolution. 


4.3.2 Fourier Ratio Method 


This alternative Fourier technique involves two regions of the spectrum. One of 
them, usually the low-loss region J\(EZ) containing the zero-loss peak and energy 
losses up to typically 100 eV, is used as a deconvolution function or “instrument 
function” for the second region. The latter is typically the background-subtracted 
core-loss region J,(£), in which case the result should be an unbroadened single- 
scattering core-loss intensity K'(E), obtained on the assumption that 
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Fig. 4.6 (a) Part of the energy-loss spectrum recorded from a thick region of a boron nitride 
(t/ = 1.2) using 80-keV incident electrons and 6 = 100 mrad. (b) Same energy region after Fourier 
log deconvolution, showing an artifact generated from the change in slope at 100 eV (the position of 
the original gain change). (c) Extended energy range, showing the boron and nitrogen K-ionization 
edges prior to deconvolution. (d) Boron and nitrogen K-edges after Fourier log deconvolution. 
From Egerton et al. (1985), with permission of the Royal Society 


Ji(E) = K'(E)*i (E) /To (4.36) 


Instrumental broadening is present on both sides of this equation, in the functions 
Ji (E) and Jj(E) but not in K ! (E). The rationale behind this assumption is that plas- 
mon scattering contributes to the core-loss integral in the same proportion as it 
contributes to intensity in the low-loss region (including the zero-loss peak), 1.e., 
Im/Tk1 = Ip/Io; see Fig. 4.7. Plasmon/core-loss coupling is assumed negligible 
(Egerton, 1976b). 

Taking Fourier transforms of both sides of Eq. (4.36) gives jx(v) = klayji(v) /lo, 
which is inverted to give 


k'(v) = IoLie(v)/ir))] (4.37) 


Equation (4.37) shows that, in principle, the measured core-loss intensity J;,(E) 
can be corrected for both plural scattering and instrumental broadening by taking 
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(b) 


Fig. 4.7 (a) Low-loss region of an energy-loss spectrum. (b) Background-subtracted ionization 
edge, whose integral J; is increased by an amount /,, due to mixed (core + plasmon) scattering 


a ratio of two Fourier transforms and multiplying by the zero-loss integral Jp. As 
with the Fourier log method, however, such “complete” deconvolution is problem- 
atic because high-frequency noise components present in j,(v) become “amplified” 
when divided by j;(v). As in Section 4.1.1, this noise amplification can be avoided 
in several ways: 


(a) Gaussian modifier. Multiplying the ratio of Fourier coefficients by a Gaussian 


(b 


we 


wm 


function exp(—7o7v7) gives 


Je (v) = Ip exp(—177 07 v" ev) /fr(v)] (4.38) 


If o = AE/1.665, where AE is the FWHM of the zero-loss peak, the inverse 
transform of j,!(v) will be a core-loss SSD whose energy resolution and noise 
content are very similar to those of the original data. If the zero-loss peak in 
J,(E) occurs at data channel m = mo, the inverse transform J;!(E) is shifted to 
the left of mp channels, relative to J,(E). 

Zero-loss modifier. Using the zero-loss peak Z(E) as the noise-limiting modifi- 
cation function gives 


jb) = LW) (4.39) 


Assuming the energy resolution AE to be independent of energy loss, the 
inverse transform J;!(E) will have an energy resolution identical to that of J; (E). 
If Z(E) peaks at the same channel as the zero-loss peak present in J/(E), the 
energy shift associated with method (a) is avoided, but three Fourier transforms 
must be calculated rather than two. 

Wiener filter. If the frequency spectra of both the signal J;(E) and its associated 
noise N(E) are known approximately, it is possible to choose a noise-rejection 
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function g(v) that provides a good compromise between noise and resolution 
in the single-scattering distribution. The aim is to perform deconvolution only 
for frequencies where the signal/noise ratio is > 1. A function with a sharp 
cutoff at v = v, must be avoided, since it would introduce convolution with 
sin(27r v1 E)/(27v,E), resulting in oscillatory artifacts adjacent to any sharp fea- 
tures in the SSD. A modest increase of energy resolution is possible by using a 
Wiener filter function of the form 


g(v) = Liv) 7)? + 1/SNR}! (4.40) 


where SNR is an estimated signal/noise ratio in the core-loss region. 

(d) Delta function approximation. If we ignore instrumental broadening of the 
inelastic data, taking the low-loss spectrum to be Jg(E), in which the zero-loss 
peak Z(E) replaced by a delta function of equal area, 


Ji(E) © J}(E)*Ja (E) /To (4.41) 


so that 


Sk! (v) © Ioje(v) ja) (4.42) 


Because the delta function present in Jg(E) contains high-frequency Fourier 
components, noise amplification is avoided. The procedure gives good results 
provided the core-loss data J,(E) contains no sharp peaks of width comparable 
to the instrumental resolution. 


4.3.2.1 Practical Details 


Before applying Fourier ratio deconvolution as described above, the low-loss and 
core-loss data must be present in computer memory arrays containing the same num- 
ber N of channels; for the FFT algorithm, N = 2* where k is an integer. These two 
spectra can be from separate acquisitions, with different integration times or incident 
beam intensities, but must have the same electron volt/channel. The background to 
the lowest energy ionization edge is first removed, as described in Section 4.4, and 
the intensity extrapolated to zero at the high-E end of the range. This procedure 
makes the intensity the same at both ends of the array and satisfies the continuity 
requirement for a Fourier series; see Section 4.1.1. Likewise, the intensity should be 
approximately zero at both ends of the low-loss spectrum. 

If necessary, complex numbers can be avoided by computing real and imagi- 
nary Fourier coefficients (see Section 4.1.1 and Appendix B) and processing them 
according to the rules of complex division. For example, 


me ti fate ens ain ste 
Hat ) = Jk ke - _ IE é cal pee (4.43) 
AY) jnt+dr Jn tip Jn tJ 
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Fig. 4.8 (a) Carbon K-edge recorded from a thick sample of graphite using 80-keV incident elec- 
trons and a collection semi-angle 6 ~ 100 mrad. (b) Single-scattering K-loss intensity recovered 
using the Fourier ratio method. (c) Single-scattering distribution obtained by Fourier log decon- 
volution, followed by background subtraction. From Egerton et al. (1985), with permission of the 
Royal Society 


In the case of noise-free data and a specimen of uniform thickness, the Fourier 
ratio method can be shown to be equivalent to Fourier log deconvolution (Swyt and 
Leapman, 1984). Applied to real data, the two methods give results which are very 
similar; see Fig. 4.8. 


4.3.2.2 Effect of a Collection Aperture 


If Fp, Fx, and Fp, are fractions of the plasmon-loss, core-loss, and double (core- 
loss + one-plasmon) scattering that pass through an angle-limiting collection 
aperture, the fraction of double scattering remaining after Fourier log deconvolution 
can be shown to be 


Rpk = (Fk = Fy Fx) /F pk (4.44) 


which is analogous to Eq. (4.25) and applies to both Fourier log and Fourier ratio 
deconvolution. 

The angular distribution of double scattering can be calculated as an angular 
convolution of the core-loss and plasmon angular distributions. Taking the latter to 
be a Lorentzian function with a cutoff at 6., Rpg is estimated to be less than 4% for 
B > 0, (Egerton and Wang, 1990). The plural scattering intensity left behind after 
deconvolution is appreciable only for small values of 6 and high-energy edges; see 
Fig. 4.9. 
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Fig. 4.9 Fraction of mixed (plasmon + core-loss) scattering remaining after deconvolution, plotted 
against edge energy for plasmon energies of 20 and 30 eV and three values of collection semi- 
angle. The calculations are for an incident energy of 80 keV, allowing direct comparison with the 
data points based on Monte Carlo simulations of Stephens (1980). From Egerton and Wang (1990), 
copyright Elsevier 


4.3.3 Bayesian Deconvolution 


Maximum-entropy and maximum-likelihood deconvolution (Section 4.1.3) can be 
applied to the core-loss region, taking the kernel as either the zero-loss peak or the 
entire low-loss region (extrapolated to zero at the highest energy loss, if necessary). 
In the latter case, deconvolution removes plural (core + plasmon) scattering from the 
ionization edge, as well as sharpening it. Figure 4.10(a,b) illustrates how Bayesian 
deconvolution can improve the energy resolution and dramatically increase the split- 
ting of overlapping peaks, provided the spectral intensity (counts per channel) is 
high and the shot noise correspondingly low. Using the Richardson-Lucy algorithm, 
Gloter et al. (2003) reported an increase in resolution from 0.9 eV to 0.3 eV at 
the diamond K-edge. Kimoto et al. (2003) applied the same algorithm to the boron 
K-edge in BN, sharpening the n*-peak from 0.52 to 0.35 eV (FWHM). 

One advantage of these methods is their response to discontinuities in the data; 
as shown in Fig. 4.10c, artifacts are generated locally rather than spread across the 
spectrum, as with Fourier methods. In fact, raw core-loss data (without background 
subtraction or extrapolation) can be processed without generating obvious artifacts 
simply by choosing the energy range so that the spectral intensity is equal at both 
ends (Egerton et al., 2008). 


256 4 Quantitative Analysis of Energy-Loss Data 


10° 
10° 
2 H 2 
Cc H c 
G i o 
S | S 
o G 
5 5 
8 8 
0 0 

284 288 E(ev) 292 284 288 E(eV) 292 


- - Original 
+ + + + Fourier-ratio 
——RL_15 


520. 540 560 580 E(eV) = 8 @  @& EEX 


Fig. 4.10 Carbon K-edge if graphite before (solid curve) and after (dashed curve) maximum 
entropy deconvolution. The spectra were recorded with an energy resolution of (a) 0.9 eV and 
(b) 0.4 eV; in both cases deconvolution enhances the splitting of the o* peak. From Overwijk and 
Reefman (2000), copyright Elsevier. (c) Oxygen K-edge after Fourier ratio deconvolution (dotted) 
and Richardson—Lucy deconvolution and sharpening (solid curve) with 15 iterations. The arrow 
shows an artifact generated by the discontinuity in intensity at 510 and 600 eV. (d) Peak generated 
by using a low-loss spectrum of Si3N4 as both the data and the kernel in RL deconvolution. From 
Egerton et al. (2008), copyright Elsevier 


Since core-loss data do not contain a zero-loss peak, there is no immediate way of 
judging the energy resolution after a given number of iterations. However, by using 
the same (zero-loss or low-loss) data as both data and kernel, a peak is generated 
(at E=0) whose width gives an approximate indication of the resolution after a 
given number of iterations; see Fig. 4.10d. 


4.3.4 Other Methods 


Muto et al. (2006a) describe use of the Pixon method, based on Bayes’ theorem but 
said to avoid the global oversmoothing inherent in the maximum entropy method. It 
was applied to the two-dimensional output of the CCD camera of the Enfina spec- 
trometer, thereby correcting for the detector point-spread function and for curvature 
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of the spectrum in the cross-dispersion direction. The authors demonstrate that a 
LaBg source run at low current can achieve an energy resolution of 0.6 eV at 
the K-edge m* peak of h-BN, becoming 0.38 eV after deconvolution. Muto et al. 
(2006b) demonstrate that the Pixon method can deal with noisy core-loss data. 

Wachtmeister and Csillag (2011) describe an iterative method (Gold, 1964) for 
increasing the energy resolution, based on matrix manuiplation. By using the low- 
loss spectrum as a response function, plural scattering was simultaneously removed 
from core-loss data, giving results in agreement with Fourier ratio deconvolution. 

A method for removing plural scattering has been described, based on spectra 
recorded from samples of different thickness (Bradley et al., 1985) or different inci- 
dent beam energy (Gibbons et al., 1987). It uses a delta function approximation for 
the zero-loss peak, so no attempt is made to sharpen the data, and incorporates an 
expression for the angular dependence of scattering, so the method can accommo- 
date spectra recorded with an off-axis collection aperture (Bradley and Gibbons, 
1986). Reasonable results were obtained from aluminum K-edge data. 

Verbeeck and Bertoni (2009) reported results on a model-based procedure that 
used maximum entropy or maximum-likelihood fitting of simulated core-loss data 
to the expression O(£)* P(E) + B(E). Here P(E) is a low-loss function that includes 
instrumental broadening and plural scattering, O(E) represents a parameterized 
model of the core-loss SSD and B(E) is a parameterized background. O(E) was 
taken to be a product of a smooth atomic edge profile o(£) and an “equalization” 
function f(E) representing ELNES modulations. The number of variable parame- 
ters was chosen as a compromise between the accuracy of the model (including 
energy resolution) and error bars of the parameters themselves. For the simulated 
data, f(E) was calculated using the FEFF program (Section 4.7), P(E) being a set 
of Poisson-weighted Lorentzian functions representing the low-loss spectrum, with 
B(E) a power-law background. Poisson noise was added to O(E) and P(E) to sim- 
ulate real data. The degree of fit was measured from the RMS deviation between 
the fitted O(E) and the known true value, for comparison with the RMS devia- 
tion obtained from Fourier deconvolution with a Gaussian modifier or Wiener filter. 
Differences between the methods were modest (less than a factor of 2) but RMS val- 
ues were smallest for the model-based procedure. An assumption of this procedure 
is that the spectral components involved are known and can be reliably simulated. 


4.4 Separation of Spectral Components 


An energy-loss spectrum contains contributions from various types of inelastic 
scattering, and it is often necessary to separate these components for subsequent 
study. The most common requirement is separation of an ionization edge from its 
underlying background, for the purpose of elemental or fine structure analysis. In 
this Section, we begin by discussing methods for modeling and extrapolating this 
background, together with the statistical and systematic errors involved. We then 
consider more elaborate methods that are used to identify and separate spectral 
components in a more general way. 
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As discussed in Section 3.5, the spectral intensity due to any single energy-loss 
process has a high-energy tail that approximates to a power-law energy dependence: 
AE~". The coefficient A can vary widely (depending on the incident beam current, 
for example) but the exponent r is generally in the range 2-6. The value of r typically 
decreases with increasing specimen thickness (Fig. 3.35) because of plural scatter- 
ing contributions to the background, decreases with increasing collection semi-angle 
B, and tends to increase with increasing energy loss, as in Fig. 3.40. Consequently, 
values of A and r must be measured at each ionization edge. The energy dependence 
of the background can be measured over a fitting region immediately preceding 
the edge and will remain much the same over a limited energy range beyond the 
ionization threshold. 

Other functions, such as an exponential, polynomial, or log-polynomial, have 
been used for the E-dependence of the pre-edge background and are sometimes 
preferable to the power-law model. Polynomial functions can behave wildly when 
extrapolated to higher energy loss, although such behavior is avoidable by using a 
“tied” polynomial, forced to pass through a data point far beyond the edge. A sim- 
ilar technique can be employed for the power-law background, as discussed below. 
A power-law fit to the experimental background is likely to be improved if any 
“instrumental” background (e.g., due to electron scattering within the spectrometer) 
is subtracted from the spectrum prior to background modeling. 


4.4.1 Least-Squares Fitting 


A standard technique, giving good results in the majority of cases, is to match 
the pre-edge background J(£) to a function F(E) whose parameters (e.g., A and r) 
minimize the quantity 


x = [i - F)/oi? (4.45) 


where i is the index of a channel within the fitting region and o; represents the 
statistical error (standard deviation) of the intensity in that channel. For simplicity, 
oj; is generally assumed to be constant over the fitting region, in which case the 
fitting procedure is equivalent to minimizing the mean-square deviation of J(E) from 
the fitted curve. The statistical formulas required are simpler if some function of J(E) 
can be fitted to a straight line: y = a + bx, for which the least-squares values of the 
slope and y-intercept are given by (Bevington, 1969): 


— N&xiy; — Ux; Ly; 
~ NDx? — (Ex; 
a=DXyi/N-—bxXx;/N (4.47) 


(4.46) 


Here, all summations are over the fitting region, containing N channels. 
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In the case of the power-law function F(Z) = AE~’, linear least-squares fitting 
is enabled by taking logarithms of the data coordinates. In other words, y; = log(Jj) 
and x; = log(E;) = log[(m — mo)dE], where m is the absolute number of a data 
channel, mo is the channel number corresponding to E = 0, and dE is the energy-loss 
increment per channel. Least-squares values of a and b are found by implementing 
Eqs. (4.46) and (4.47) within the data-storage computer and the fitting parameters 
are then given by r = —b and log(A) =a. As an estimate of the “goodness of fit,” the 
parameter yx* can be evaluated using Eq. (4.45), taking 0; ~ ./J; on the assumption 
that electron beam shot noise is predominant. More useful is the normalized x 
parameter x,7 = x7/(N — 2), which is less dependent on the number N of channels 
within the fitting region. Alternatively, a correlation coefficient can be evaluated 
(Bevington, 1969). 

Linear least-squares fitting is satisfactory for nearly all pre-edge backgrounds 
(Joy and Maher, 1981) but systematic errors can occur if the number of detected 
electrons per channel J; falls to a very low value (<10), a situation that may 
arise in the case of energy-filtered images (Section 2.6). The fractional uncertainty 
o;/J; © J;—'/? is then large and the error distribution becomes asymmetric, particu- 
larly after taking logarithms of the data, resulting in a systematic error of about 2% 
for J; = 10, increasing to 20% for J; =3 (Egerton, 1980d). Trebbia (1988) used a 
maximum-likelihood method to calculate the background, a procedure that avoids 
bias introduced by the nonlinear transformation in the least-squares method (Pun 
et al., 1984). 

After fitting in a pre-edge window, the background is usually extrapolated to 
higher energy loss and subtracted to yield the core-loss intensity corresponding to 
the ionization edge. In general, extrapolation involves both systematic and statistical 
errors, as discussed in Section 4.4.3. 

These errors can be reduced if the edge extends to high enough energy loss, 
such that the core-loss intensity falls to a small fraction of its threshold value. 
Extrapolation can then be replaced by interpolation, simply by using a fitting win- 
dow split into two halves: a pre-edge region and a second region at high energy loss. 
Least-squares fitting is performed over the channels in both regions, a straightfor- 
ward procedure using the Gatan DigitalMicrograph software. The fitted background 
then passes through the middle of the data in both halves of the fitting region, mak- 
ing the background-subtracted intensity approximately zero at both ends of its range. 
Sometimes this is an advantage, as when background removal precedes Fourier-ratio 
deconvolution, for example, to remove plural scattering prior to fine-stucture anal- 
ysis. However, it will likely lead to an understimate of the core-loss integral J;. 
This systematic error can be reduced if it is assumed that the core-loss intensity has 
an AE" dependence (with same exponent r as the background) for energies well 
beyond the threshold. The core-loss intensity within the upper fitting window can 
then be estimated and allowed for (Egerton and Malac, 2002). A program (BFIT) 
implementing this procedure is described in Appendix B and has enabled boron 
concentrations below 1% to be reliably measured (Zhu et al., 2001). 
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4.4.2 Two-Area Fitting 


In this simple method of background fitting, the fitting region is divided into two 
segments of equal width and the power-law parameters A and r are found by mea- 
suring the respective integrals J; and Jy (see Fig. 4.11). If the background were 
decreasing linearly with energy loss, each area would be given by the parallelogram 
rule 


1 = (B3 — E\)[J(E1) + J(E3)1/2 (4.48) 
and similarly for J). In the case of a power-law background, it turns out to be more 


accurate to replace the arithmetic average of intensities in Eq. (4.48) by a geometric 
average: 


I = (E3 — E\) (EJ (E3)1'" (4.49) 


and likewise for Jy, so that 


7 E3-—E,| Ej =a 
ey (4.50) 
bh Ey — E3| Eo 
If E3 = (E} + E2)/2, Eq. (4.50) becomes J} /I © (E) /E2)~"/? and 
r ® 2log(/lh)/ log(E2/£)) (4.51) 


By straightforward integration of J(E) = AE", Eq. (4.54) can be shown to be exact 
for r=2. More surprisingly, the formula remains remarkably accurate for higher 
values of r, the systematic error in the background integral J, being typically less 
than 1%, as illustrated in Table 4.1. The factor of 2 in Eq. (4.51) would be absent 
for narrow and widely spaced energy windows, a situation which is unfavorable in 
terms of statistical noise. 


Fig. 4.11 Two-area method 
of background fitting. Values 
of A, r, and Jy are obtained by 
measuring the areas J; and Jp 
under the background just 
preceding an ionization edge 
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Table 4.1 Systematic error involved in the two-area method* 


Ip(exact) Ip(exact) 
r (exact) r from Eq. (4.51) ——_— —— 
Ibi +h) Ip(2) 
2 2.000 1.0000 1.0000 
3 3.007 1.0021 1.0014 
4 4.019 1.0057 1.0036 
5 5.035 1.0109 1.0042 


@ J(E) = AE~' and energies appropriate to a carbon K-ionization edge: FE; = 200 eV, £3 = 240 eV, 
E = 280 eV, and E4 = 360 eV. The last two columns indicate the fractional error in the background 
integral /,, calculated using values of A obtained from Eqs. (4.52) and (4.53), respectively. 


The value of A is obtained from either of the following equations: 
A=(1-n(i+h)/E'7 -E\'’) (4.52) 
A= (1—nh/(E)'" — E3'~’) (4.53) 


Having computed A and r, the background contribution /, beneath the ionization 
edge can be calculated; see Fig. 4.11. Of the two equations for A, Eq. (4.53) will usu- 
ally result in a more accurate value of Ip; the systematic error is less (see Table 4.1) 
and, more importantly, the statistical extrapolation error (Section 4.4.4) is likely to 
be smaller since increased weight is given to background channels close to the edge. 

Because it involves only a single summation over J;, the two-area method can be 
executed very rapidly, a worthwhile consideration if the background fitting must be 
done a large number of times, as in STEM elemental mapping and spectrum imaging 
(Section 2.5.1). Using initial values of A and r derived from the two-area method, 
a ravine-search program (Bevington, 1969) has been used to provide a better fit to 
noisy data (Colliex et al., 1981). This procedure gave the variances of A and r and 
also x7 as a test of the significance level of the fit. 


4.4.3 Background-Fitting Errors 


In addition to a possible systematic error, any background fitted to noisy data has a 
statistical uncertainty. In the case of peaks superimposed on a smooth background 
(as in x-ray emission spectra, for example), the background can often be measured 
on both sides of the peak and its contribution below the peak deduced by inter- 
polation. In a core-loss spectrum, the background is usually sampled only on the 
low-energy side of the ionization edge and must be extrapolated to higher energies, 
resulting in a comparatively large statistical error in the background integral J. 

In the case of linear least-squares fitting, the statistical error can be estimated 
using standard formulas (Bevington, 1969) for the variances of the parameters 
a and b in the equation y = a+ bx. To ensure that the coefficients a and b are 
statistically independent and thereby avoid the need to evaluate a covariance term, 
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Fig. 4.12 (a) Schematic 
diagram of an ionization 
edge, defining the widths of 
the background fitting and 
integration regions (I" and A, 
respectively) and the 
core-loss and background 
integrals (J; and Ip, 
respectively). (b) Same region 
of the spectrum, plotted on 
logarithmic coordinates. The 
X-axis origin is at the center 
of the fitting region. One of 
the data points within the 
fitting region is shown, (b) 
together with its standard 

deviation. From Egerton 

(1982a), copyright Elsevier 


the origin of the x coordinate must be located at the center of the fitting region; see 
Fig. 4.12. To illustrate the method of calculation, we first consider the simple case of 
a linearly decreasing background, for which the background integral is of the form 
Ty = n[a+b(m+n)/2], where m and n are the number of data channels in the fitting 
region and the integration regions, respectively. The variance of Jycan be obtained 
from the general relation: 


2 2 
var(Ip) = (3) var(a) + (3) var(b) (4.54) 


Denoting the average standard deviation of intensity within a single channel of the 
background-fitting region by o, the variances of a and b are given by 


var(a) = 07/m (4.55) 
2 Be, oO _ 120 
var (b) © o DQ PR apart, om (4.56) 
i=—m/2 —m/2 


Combining the previous three equations, we obtain 


var(Ip) = (on /m)[1 + 311 + n/m)*] (4.57) 
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If o arises entirely from the counting statistics and if the range of extrapolation is 
small, so that the electron intensity in the integration region is nearly equal to that 
in the fitting region, o* * J,/n. In a typical case, these two regions have similar 
widths (m * n) and Eq. (4.57) gives vary) © 13%), where Jp is in units of detected 
electrons. 

The equivalent analysis for a power-law background is equivalent to the above 
except that the x—y plot now involves logarithms of the data and the origin of the 
x-axis corresponds to a geometric-mean energy loss Em = (EjEx)'/ 2: see Fig. 4.12b. 
The y-axis standard deviation is now related to the intensity J in each channel by 
o © InVJ — J'/*) — In()~ J'/? and the background integral is given by 


En 
. Eme* 
= Te (l+b)xn _ ,+b)xx 
n= f AE dE = =wAG e ) (4.58) 
Ex 
and its variance by 
var(Ip) = Ip?var(a) + [C7 Em7e*4/(1 + b)*|var(b) (4.59) 


where C = exp[(1 + b)x,][C. + b)x, — 1] — exp[( + b)x,J[0 + b)xy — 1], a = 
In(A E;,~"), b = —r and the coordinates x, and x, are as defined in Fig. 4.12. 

The inner-shell intensity J, is obtained by integrating the total intensity between 
x; and x, (to give an integral /,) and subtracting the background integral [,. 
Statistical errors in /, and in Jy are therefore additive: 


var(,) = var(;) + varUp) = Ik + Ip + varUp) (4.60) 


The last term in Eq. (4.60) represents the background extrapolation error, which 
usually contributes most of the uncertainty in J;. By treating J; as the required signal 
and [var(/;,)]!/* as its statistical noise, the signal/noise ratio can be written as 


SNR = Kyfvar() 71/7 = Ik /Uk + hp)? (4.61) 


where the dimensionless parameter h = [Jy + var(/p)]/Jp represents the factor 
by which the background-dependent part of var(/,) is increased by fitting and 
extrapolation errors. 

If the width of the integration region is sufficiently small, the extrapolated back- 
ground approximates to a straight line over this region and Eq. (4.57) could be used 
to estimate the value of h; for example, = 14 if m = n. This large value illustrates 
how statistical noise in the fitting region becomes “‘amplified” by extrapolation. 

In the more general case, Eq. (4.59) and Fig. 4.13a indicate that I should 
be comparable to or larger than A in order to avoid a large extrapolation error 
(h >> 1). Figure 4.13b shows that as A increases (keeping I’ constant), the 
signal/noise ratio first increases and then falls slightly. The initial rise is due to 
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Fig. 4.13 Extrapolation parameter h and signal/background ratio (in arbitary units) as a function 
of (a) width I’ of the background-fitting region and (b) width A of the integration region. The cal- 
culations assume power-law background («E£~") and edge («E~*) intensities. Dashed curves show 
SNR for a weak ionization edge; dotted curves are for a strong edge (equal edge and background 
intensities). From Egerton (1982a), copyright Elsevier 


increase in the signal J;; the subsequent decrease arises from the rapid increase in h 
as the range of extrapolation is extended. 

Berger and Kohl (1993) analyzed how statistical and other factors influence the 
choice of instrumental parameters for elemental mapping. As always in energy- 
filtered imaging, spatial resolution is of prime importance; the effect of chromatic 
aberration (Section 2.3.2) puts further constraints on A and results in smaller val- 
ues (typically 20 eV) being used than those that minimize statistical and systematic 
errors. Kothleitner and Hofer (1998) published contour maps showing how SNR 
varies with the width and position of the integration window, for different types of 
ionization edge. Not surprisingly, this window should start at the ionization thresh- 
old (where intensity is highest) in the case of a sharp (hydrogenic) edge but should 
be located around the broad maximum in the case of a delayed edge. 

The statistical error in J; is much reduced if background extrapolation is replaced 
by interpolation, as discussed in Section 4.4.1. For linear interpolation, , = an, 
var(a) = o7/m, var(b) = 0, var(Ip) = n?o07/m and SNR = Im/Uk + Ip + (n/m)Ip], 
giving h * 2 for an equal number of fitting and integration channels (m * n). 
Borglund et al. (2005) have advocated spectral filtering by principal components 
analysis (PCA) as a way of reducing noise and improving background fitting. MLS 
fitting can also achieve better statistical accuracy, as discussed in the next section. 
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4.4.4 Multiple Least-Squares Fitting 


Because of uncertainties in background fitting (Section 4.4.3), the extrapolation 
method of isolating the core-loss intensity fails for very noisy data, for ionization 
edges that are weak relative to the background, and for edges that occur in close 
proximity. In these cases, the situation is improved by using a fitting procedure 
that involves both the background and the ionization edge(s). Multiple least squares 
(MLS) methods fit the total spectral intensity J(E) to an expression typically of the 
form 


F(E) = AE + ©), BnS,(E) (4.61a) 


The first term represents a background preceding the edge of lowest energy loss, 
while the S,,(Z) terms represent core-loss reference spectra of the elements of inter- 
est. They are usually recorded from external standards but Mendis et al. (2010) have 
described a procedure that requires only data from the sample being analyzed. 

The coefficients B, can be found by minimizing Xj(Jj — F;y for data chan- 
nels 7 covering the entire region (Leapman and Swyt, 1988; Manoubi et al., 1990; 
Leapman, 1992) or by maximum-likelihood estimation (Verbeeck and Van Aert, 
2004). In the latter case it is possible to calculate an unbiased estimate of the confi- 
dence limits of the fitting. Elemental analysis from least-squares fitting is discussed 
in Section 4.5.4. 


4.4.5 Multivariate Statistical Analysis 


The analysis of elemental distributions in materials science specimens often 
involves recording spectrum image data, resulting in a large amount of informa- 
tion containing the ionization edges of several elements, often with overlap between 
them, as well as low-loss data. Because the recording time is limited by specimen 
drift, for example, there is often a substantial noise component. Although drift- 
correction procedures are available (e.g., Schaffer et al., 2006; Heil and Kohl, 2010), 
this situation calls for a procedure that can sort through the data, deal effectively 
with the noise, and extract the information of interest, in other words multivariate 
statistical analysis (MVA or MSA).* 

The most widely used multivariate method is principal component analysis 
(PCA). The signal is assumed to consist of a linear combination of contributions 
from individual elements or compounds and to have higher variance than the noise, 
allowing the noise be isolated from the signal and largely eliminated from the 
data. To comply with the assumption of linearity, plural scattering should first be 
removed by deconvolution. The spectrum S; from each image point (x,y) can then 


4MATLAB freeware for multivariate image analysis is available from http://macc.mcmaster.ca/ 
research/software/maccmia. 
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be written (Trebbia and Bonnet, 1990) as the sum of components X;, each with a 
weighting P; x: 


Ji = UpPipXk (4.61b) 


These components are orthogonal, meaning XX = 8! or f X;(E) Xy (E) dE = 0 
except for k = k’ and are therefore described as eigenspectra. Unlike the terms in 
Eq. (4.61a), they do not in general correspond to the ionization edges of particular 
elements and may not have any direct physical meaning (although for uncentered 
data the first component represents an average spectrum). 

The spectrum—image data is arranged as a two-dimensional data matrix 
D((x, y), £) by combining the two spatial dimensions (x, y) and storing them in 
columns of the data matrix, while the spectral information (E-dependence) is stored 
in the rows. In the PCA process, this data matrix is decomposed as (Bosman et al., 
2006): 


Daye = Say nL na (4.61c) 


where S is known as a score matrix and L! is the transpose of a loading matrix 
L. Now each row of the matrix L! contains an eigenspectrum, uncorrelated with 
the other rows, while each column of S gives the spatial distribution of the corre- 
sponding eigenspectrum in the loading matrix. The individual product of each row 
of the loading matrix and each column of the score matrix is called a component. 
The number of components is n and is equal to the smaller of x-y and E. 

Matrix decomposition is carried out by applying eigenanalysis or singular value 
decomposition to the data matrix (Joliffe, 2002; Malinowski, 2002), the singular 
values being equivalent to the square root of the eigenvalues. The relative mag- 
nitude of each eigenvalue indicates the amount of variance (information) that the 
corresponding principal component contributes to the data set. In the decomposed 
matrices, the components are ordered from high to low, according to their eigen- 
values and therefore the variance or information content. The number of useful 
components is typically much less than n, the lower-variance components repre- 
senting noise. By using only the useful (principal) components, the original data 
set can be reconstructed with the noise removed and without sacrificing spatial or 
energy resolution. 

Several methods have been developed to identify the information-carrying prin- 
cipal components. A common approach is to plot the logarithms of the eigenvalues 
against component number, in a so-called scree plot; see Fig. 4.14a. The variance 
decreases rapidly with component number and then exhibits a slower exponential 
decline, forming a straight line on the scree plot (which resembles geological scree 
at the base of a mountain slope). The components used for reconstruction are chosen 
as those that precede this linear portion of the curve. 

The MSA process can be illustrated by data derived from a BN test specimen, 
where Bosman et al. (2006) identified nine non-noise components. Preprocessing 
the data by dividing by a weighted average to take into account the Poisson nature 
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Fig. 4.14 (a) Scree plot for the first 25 eigenvalues for spectrum image data recorded from a 
specimen of BN flakes supported on a lacey carbon film, with and without a pre-PCA weighting. 
(b) Score images and loading spectra for the first six principal components derived by principal 
component analysis of SI data from the BN/C specimen. From Bosman et al. (2006), copyright 
Elsevier. For another example of MSA application, Lozano-Perez et al. (2009) 


of the noise (Keenan and Kotula, 2004) reduced this number to six; see Fig. 4.14b. 
The first component represents the average loading in each pixel of the original 
image and displays the ionization edges involved. The second principal compo- 
nent shows an anticorrelation between BN and carbon; in the associated score 
image, carbon is bright and BN dark. The third component reveals a systematic 
artifact due to incorrect background subtraction: a power-law background was fit- 
ted in front of the boron K-edge but undersubtracted the true background at higher 
energy loss. This false signal is the main contribution in areas containing only car- 
bon, which provides a downward-sloping spectrum whereas the third component 
is upward-sloping, making the third component negative (anticorrelated with the 
carbon K-edge) above 340 eV. 

The fourth principal component of the BN data contains ELNES information and 
demonstrates an anticorrelation between m* and o* features of the B and N K-edges. 
The m* peak represents directional sp* bonds, perpendicular to the plane of the BN 
flakes. The momentum transfer in this experiment was mainly perpendicular to the 
incident beam, so the fourth score image is bright where flakes lie parallel to the 
beam and dark where they are perpendicular. The fifth principal component repre- 
sents slight misalignment between the core-loss and low-loss spectrum images, the 
Fourier ratio deconvolution generating spectral artifacts in areas of strong thickness 
variation, while the sixth component represents slight energy-scale misalignment of 
the spectra. 

Independent component analysis (ICA) attempts to find components that are 
mutually independent rather than orthogonal (Hyvarinen et al., 2001). The most 


i) 
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strongly correlated components of core-loss spectra are the backgrounds to ion- 
ization edges but they can be largely removed by taking a derivative of the intensity 
(Bonnet and Nuzillard, 2005), so there is some advantage to working with derivative 
data. 

The principles of ICA are well illustrated by a study by de la Pefia et al. 
(2011) on aa Sno.5Tio.sO2 nanopowder. They first used EELSLab software (Arenal 
et al., 2008) to perform principal component analyis, which is less computer inten- 
sive, deals better with noise and avoids problems associated with over-learning 
(Hyvarinen et al., 2001). Six principal components were found but as the spec- 
tra were not deconvolved to remove plural scattering, some of these components 
were suspected to arise from nonlinearity and variations in carbon contamination 
thickness. Restricting the analysis to the range above 430 eV reduced the number 
of principal components to three, but these appeared to represent a non-physical 
mixture of Ti, Sn, and O, making the distribution maps difficult to interpret in 
chemical terms; see Fig. 4.15(a—c). Kernel-independent component analysis (Bach 
and Jordan, 2002) was then used to extract components that strongly resembled the 
spectra of TiO2 and SnOz; see Fig. 4.15(d—f). The fact that the SnO2 component was 
not visible in any of the PCA spectra illustrates the advantage of obtaining signa- 
ture spectra for the component oxides, rather than attempting to identify individual 
elemental signals (which in this case are compromised because of edge overlap). 


b 
500 600 700 E(eV) 500 600 700 E(eV) 


Fig. 4.15 (a—c) Three principal components (spectra and images) given by PCA for a Sng.5Ti9,5O2 
nanopowder specimen. (d-f) ICA components and distribution maps showing (d) the overall spec- 
tral background, (e) rutile-phase TiO2, and (f) SnO2. From de la Pefia et al. (2011), copyright 
Elsevier 
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These results demonstrate how ICA can in certain cases transform the principal 
components into easily interpretable independent components. 


4.4.6 Energy- and Spatial-Difference Techniques 


If an energy-loss spectrum is differentiated with respect to energy loss, the 
slowly varying background to an ionization edge is largely eliminated. First- or 
second-difference spectra can be obtained by digitally filtering conventional spectra 
(Zaluzec, 1985; Michel et al., 1993) or by using a spectrum-shifting technique with 
a parallel recording spectrometer (Section 2.5.5). These spectra can then be fitted 
to energy difference reference spectra, using MLS fitting without the background 
term in Eq. (4.73). Because difference spectra are highly sensitive to fine structure 
oscillations in intensity, reliable quantification may depend upon the chemical envi- 
ronment being similar in the standard and the unknown sample. This condition is 
more easily met for metals and biological specimens than for ionic and covalent 
materials (Tencé et al., 1995). 

Another way of dealing with the pre-edge background is to record spectra from 
a region of interest in the specimen (such as an interface or grain boundary) and 
from a nearby “matrix” region. The matrix spectrum (scaled if necessary to allow 
for changes in thickness or diffracting conditions) is subtracted from the original 
data, yielding a spatial-difference spectrum that represents change in the ionization 
edge intensity between the two locations. Because the pre-edge background is 
largely eliminated, simple integration can be used for elemental quantification; see 
Fig. 4.16. 
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Fig. 4.16 Spatial difference and second energy difference spectra recorded from a nitrogen- 
containing voidite in diamond. The shape of the K-edge (white-line peak followed by a broad 
continuum) is consistent with molecular nitrogen. From Miillejans and Bruley (1994), copyright 
Elsevier 
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One advantage of the spatial-difference method is that, provided the changes 
in composition are small and the crystal structure is similar in the two locations, 
systematic variations in background intensity (due, for example, to extended fine 
structures from a preceding edge) are eliminated in the subtraction. Miillejans 
and Bruley (1994) have discussed other advantages of this technique in terms of 
signal/noise ratio. 

A similar spatial-difference procedure can be useful for fine structure stud- 
ies (Bruley and Batson, 1989). However, Muller (1999) has argued that bonding 
changes at an interface can alter the width of the valence band, producing a change 
in local potential and a core-level shift, particularly in 3d transition metals where 
the valence states penetrate the atomic core. The core-level shift adds to the differ- 
ence spectrum a first-derivative of the edge, which could be mistaken for a white 
line. The boundary should be sampled with a sufficiently small probe, so that the 
spectrum contains as little intensity as possible from the surrounding volume. 


4.5 Elemental Quantification 


Because inner-shell binding energies are separated by tens or hundreds of eV, while 
chemical shifts (Section 3.7.4) amount to only a few eV, the ionization edges in 
an energy-loss spectrum can be used for elemental analysis. The simplest way of 
making this analysis quantitative is to integrate the core-loss intensity over an appro- 
priate energy window, making allowance for the noncharacteristic background. Such 
a procedure is preferable to measuring the height of the edge, which is sensitive to 
the near-edge fine structure that depends on the structural and chemical environment 
of the ionized atom (Section 3.8). When the element is present in low concentration, 
it is difficult to obtain sufficient accuracy in the background extrapolation, so a more 
effective procedure is to fit the experimental spectrum in the core-loss region to the 
sum of a background and a reference edges, as discussed in Section 4.4.4. In either 
case, ionization cross sections for each edge are required. 


4.5.1 Integration Method 


Incident electrons can undergo elastic, low-loss (plasmon), and core-loss scatter- 
ing, making their angular and energy distributions quite complicated. By making 
approximations, however, we can obtain simple formulas that are suitable for routine 
quantitative analysis. 

Assume initially that inner-shell excitation is the only form of scattering in the 
specimen. From Eq. (3.94), the integrated intensity of single scattering from shell k 
of a selected element, characterized by a mean free path A, and a scattering cross 
section 0%, would be given by 


Ie! = In(t/Ax) = N Ip ox (4.62) 
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Ip represents the unscattered (zero-loss) intensity and N is the areal density (atoms 
per unit area) of the element, equal to the product of its concentration and the spec- 
imen thickness. If we record the scattering only up to an angle f and integrate its 
intensity over a limited energy range A, the coreloss integral is 


I,'(B, A) =N Ip ox(B, A) (4.63) 


where o;(f, A) is a “partial” cross section for energy losses within a range A of the 
ionization threshold and for scattering angles up to 6, obtainable from experiment 
or calculation (Section 4.5.2). 

The effect of elastic scattering is to cause a certain fraction of the electrons to 
be intercepted by the angle-limiting aperture. To a first approximation, this fraction 
is the same for electrons that cause inner-shell excitation and those that do not, in 
which case i'(p , A) and Jp are reduced by the same factor. Therefore the core-loss 
integral becomes 


I! (B, A) © N Io(B) ox(B, A) (4.64) 


where /o(8) is the actual (observed) zero-loss intensity. Equation (4.64) applies to a 
core-loss edge from which plural (core-loss + plasmon) scattering has been removed 
by deconvolution. 

If we now include valence electron (plasmon) excitation as a contribution to the 
spectrum, its effect is to redistribute intensity toward higher energy loss, away from 
the zero-loss peak and away from the ionization threshold. Not all of this scattering 
falls within the core-loss integration window, but to a first approximation the fraction 
that is included will be the same as the fraction that falls within an energy window 
of equal width in the low-loss region. If so, the core-loss integral (including plural 
scattering) is given by 


I(B, A) © N I(B, A) ox(B, A) (4.65) 


where [(8,A) is the low-loss intensity integrated up to an energy-loss A; see 
Fig. 4.17. 

Although Eqs. (4.64) and (4.65) allow measurement of the absolute areal density 
N of a given element, an atomic ratio of two elements (a and b) is more commonly 
required. Provided the same integration window A is used for both edges, Eq. (4.65) 
gives 


Na _ lka(B, A) ojp(B, A) 
No Tjn(B, A) oKa(B, A) 


The shell index can be different for the two edges (j 4 k); K-edges are suitable for 
very light elements (Z < 15) and L- or M-edges for elements of higher atomic num- 
ber. However, our approximate treatment of plural scattering will be more accurate if 
the two edges have similar shape. If plural scattering is removed from the spectrum 
by deconvolution, Eq. (4.64) leads to 


(4.66) 
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Fig. 4.17 Energy-loss spectrum showing the low-loss region and (with change in intensity scale) 
a single ionization edge. The cross-hatched area represents electrons that have undergone both 
core-loss and low-loss scattering 


Na _ "a(B» Sa) oj (B, Av) 
Np T,(B, Ap) Ska (B; Aa) 


(4.67) 


A different energy window (A, 4 Aj) can now be used for each edge, larger values 
being more suitable at higher energy loss where the spectrum is noisier and the 
edges representing different elements are spaced further apart. 

Several authors have tested the accuracy of these equations for thin specimens. 
Approximating the low-loss spectrum by sharp peaks at multiples of a plasmon 
energy Ep, Stephens (1980) concluded that the approximate treatment of plural 
scattering inherent in Eq. (4.65) will lead to an error in N of between 3 and 10% 
(dependent on Ex) for t/Ap = 0.5 and A/E, = 5. The error would be some fraction 
of this when evaluating elemental ratios. Systematic error arising from the angular 
approximation inherent in Eq. (4.64) was estimated to be of the order of 1% for a 
20-nm amorphous carbon film but considerably larger for polycrystalline or single- 
crystal specimens if a strong diffraction ring (or spot) occurs just inside or outside 
the collection aperture (Egerton, 1978a). 

As specimen thickness increases, the higher probability of elastic and plural 
scattering causes Eqs. (4.64), (4.65), (4.66), and (4.67) to become less accurate. 
Elemental ratios given by Eq. (4.66) or (4.67) were found to change when 1/A 
exceeded approximately 0.5 (Zaluzec, 1983). This variation was attributed to the 
effect of elastic scattering and has been modeled for amorphous materials of known 
composition, using an elastic scattering angular distribution given by the Lenz 
model (Cheng and Egerton, 1993, Su et al., 1995). Correction factors can be evalu- 
ated for specimens of unknown composition, based on additional measurements of 
the low-loss spectrum at several collection angles (Wong and Egerton, 1995). Such 
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Fig. 4.18 Factor by which the area density obtained from Eq. (4.64) should be divided to correct 
for elastic scattering, in the case of (a) the oxygen K-edge and (b) the silicon K-edge in amorphous 
silicon dioxide. From Cheng and Egerton (1993), copyright Elsevier 


correction becomes significant at specimen thicknesses above 100 nm, particularly 
for higher edge energy and small collection angle; see Fig. 4.18. 

Precise correction for elastic scattering in crystalline specimens is a difficult task; 
it would require the measurement of intensity in the diffraction plane or a knowledge 
of the crystal structure, orientation, and thickness of the specimen. The situation in 
single-crystal specimens is further complicated by the existence of channeling and 
blocking effects; see Sections 3.1.4 and 5.6. 


4.5.2 Calculation of Partial Cross Sections 


If the core-loss intensity is integrated over an energy window A that is wide enough 
to include most of the fine structure oscillations, the corresponding cross section 
o(B, A) should be little affected by the chemical environment of the excited atom, 
and can therefore be calculated using an atomic model. Weng and Rez (1988) 
estimated that atomic cross sections are accurate to within 5% for A > 20 eV. 

The simplest atomic model is based on the hydrogenic approximation, for 
which the generalized oscillator strength (GOS) is available in analytic form; see 
Section 3.4.1. Computation is therefore rapid and requires only the atomic num- 
ber Z, edge energy Ex, integration range A, angular range £, and incident electron 


energy Eo as inputs. Computer programs for the calculation of K- and L-shell cross 
sections are described in Appendix B. 
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More sophisticated procedures, such as the Hartree—Slater method, involve more 
extensive computation and a greater knowledge of atomic properties. However, the 
resulting GOS can be parameterized as a function of the energy and wave number 
q and the resulting values integrated according to Eqs. (3.151) and (3.157) to yield 
a cross section. Parameterization can also take account of EELS experiments and 
(for small g) x-ray absorption measurements. A program giving K-, L-, M-, N-, and 
O-shell cross sections, valid for small 6, is described in Appendix B. 

A completely experimental approach to quantification is also possible, the sensi- 
tivity factor for a given edge being obtained by measurements on standards (Malis 
and Titchmarsh, 1986; Hofer, 1987; Hofer et al., 1988). If these measurements are 
made in the same microscope and under the same experimental conditions as used 
for the unknown specimen, the procedure should be relatively insensitive to the 
chromatic aberration of prespectrometer lenses (Section 2.3.3) and imperfect knowl- 
edge of the collection angle and incident electron energy. It is analogous to the 
k-factor procedure used in thin-film EDX microanalysis. An experimentally deter- 
mined k-factor can be converted to a dipole oscillator strength f(A) that depends only 
on the integration range A, allowing partial cross sections to be calculated for any 
incident electron energy and collection angle within the dipole region. Alternatively, 
the generalized oscillator strengths obtained from Hartree—Slater cross sections can 
be parameterized (as a function of energy loss and scattering angle) and used to 
calculate o;(6,A) for a wide range of Z, Eo, A, and B, as in the Gatan DM software. 


4.5.3 Correction for Incident Beam Convergence 


Equations (4.64), (4.65), (4.66), and (4.67) assume that the angular spread of the 
incident beam is small in comparison with the collection semi-angle 6, a condi- 
tion that applies to broad beam TEM illumination but easily violated if the incident 
electrons are focused into a fine probe of large convergence semi-angle a. For 
such a probe, the angular distribution of the core-loss intensity dl, /dQ can be cal- 
culated as a vector convolution of the incident electron intensity dJ/dQ and the 
inner-shell scattering do; /dQ. Taking the latter to be a Lorentzian function of width 
Og = E/(ymov), with E + E;, + A/2, we obtain 


a «| (2 ree dOod (4.68) 
dQ ae ia 


=0 6=0 


where 6,7 = 67 + 097 — 2069 cos 9; see Fig. 4.19a. The core-loss intensity passing 
through a collection aperture of semi-angle f is then 


Ia, B, @g) = | de 6dé (4.69) 
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Fig. 4.19 (a) Calculation of area 9% dé, d¢ 
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(b) Convergence-correction 
factors F, and Fp, plotted as a 
function of the convergence 
semi-angle a of the incident 
beam, for different values of 
the characeristic scattering 
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(1990), copyright Elsevier 
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If it is true that the incident intensity per unit solid angle remains constant up to a 
cutoff angle a, the double integral of Eq. (4.68) can be solved analytically (Craven 
et al., 1981), giving 


(4.70) 


dl w? + (wt 4.40762)" 
— oln 
dQ 202 


where y? = a? + 6,7 — 6°. Combining the previous three equations yields 


2 A 2 4 292 
Ii(at, B, A) 2/a v2 + (4 + 40703) 
"= 1.0,8,A) nfl + B/osy] / ia 262 

9 > 0 E 


1/2 
Jew (4.71) 


F; is a factor (<1) representing reduction in the measured core-loss intensity due to 
incident beam convergence. Scheinfein and Isaacson (1984) showed that the integral 
in Eq. (4.71) can be expressed analytically, and their expression is used to evaluate 
F, in the program CONCOR2 listed in Appendix B. Because F; depends to some 
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degree on 6p (see Fig. 4.19), the convergence correction is different for each ioniza- 
tion edge. When using Eq. (4.66) or (4.67) to obtain an elemental ratio, the effect of 
beam convergence is included by multiplying the right-hand side by F'\p/F tq. 

To obtain a correction factor for absolute quantification, we need to consider the 
effect of beam convergence on the /ow-/loss intensity. Since 6g/8 << 1 for valence 
electron scattering, the corresponding factor F is close to 1 provided a < B (see 
Fig. 4.19). If @ > B, the recorded low-loss intensity is approximately proportional 
to the area of the convergent-beam disk that falls within to the collection aperture, 
so Eq. (4.65) becomes 


I(a, B, A) © FoN o;(B, A)I(a, B, A) (4.72) 


where F7 © F, fora < 6 and Fy © (a/B)F; for a > B; see Fig. 4.19b. 
Alternatively, the effect of incident beam convergence can be expressed in terms 
of an effective collection angle f° that differs from B by an amount dependent on 
a and the edge energy. To allow for the possibility of absolute quantification (or 
thickness measurement by the log-ratio method, Section 5.1.1), we should take 


I(t, B, A) = Foik(B, A) = KB", A), (4.73) 


in which case * > B fora > B. 

Kohl (1985) has pointed out that incident beam convergence can be included in an 
effective cross section ox(a, B, A), which could incorporate a non-Lorentzian angu- 
lar dependence at high angles, although for large collection angle the convergence 
correction is small anyway. Because the angular dependence of the incident beam 
intensity dJ/dQ is usually unknown, and almost certainly deviates from a rectangular 
function, any convergence correction is likely to be approximate. 


4.5.4 Quantification from MLS Fitting 


The coefficients B,, obtained from Eq. (4.61a) by multiple least-squares fitting can 
be used to derive atomic ratios of the elements involved. For ultrathin specimens, 
or if deconvolution has removed plural scattering, the reference spectra S,(E£) can 
be calculated differential cross sections (Steele et al., 1985), in which case each 
coefficient B; is the product of the zero-loss intensity and the areal density of the 
appropriate element. 

More usually, each S;(£) is measured on an aribitary intensity scale from a stan- 
dard specimen containing the appropriate element, in which case the atomic ratio of 
any two elements (a and D) is obtained from 


Na = Ba [ka (B, A) oD (B, A) 
Np Bo Ip (B, A) Oka (B, A) 


(4.74) 
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where [xq(B, A) and [kp(B, A) are integrals (over some convenient integration range 
A) of the core-loss spectra of the appropriate standards, ox_(B, A) and oxp(B, A) 
being the partial cross sections evaluated for the appropriate collection angle 6, 
allowing for incident beam convergence if necessary (Section 4.5.3). 

If the analyzed specimen is appreciably thicker than the standard, the reference 
edge may need to be convolved with the low-loss region of the analyzed speci- 
men, to make allowance for plural scattering. If the region just above the ionization 
edge contains prominent fine structure that is sensitive to the chemical environment 
of each element (Section 3.8), this region may have to be excluded from the fit- 
ting, especially if the chemical environments of the specimens are very different. 
Alternatively, fitting to an atomic model in the near-edge region might yield valu- 
able density of states information (Verbeeck et al., 2006). For good accuracy or 
weak edges, the importance of allowing for correlated (fixed pattern) noise of the 
electron detector has been emphasized (Verbeeck and Bertoni, 2008). Atomic-ratio 
accuracies better than 3% and precisions better than 10% have been obtained using 
test compounds (Bertoni and Verbeeck, 2008). 

Riegler and Kothleitner (2010) have analyzed chromium concentrations down to 
0.1% by using MLS fitting and have derived a formula for the minimum atomic 
fraction detectable using this procedure; see Section 5.5.4. 


4.6 Analysis of Extended Energy-Loss Fine Structure 


As discussed in Section 3.9, the EXELFS modulations that extend over some hun- 
dreds of eV beyond an ionization edge can be analyzed to provide the distances 
of near-neighbor atoms from an atom of known chemical species in a TEM spec- 
imen. In favorable circumstances, coordination numbers, bond angles, and degree 
of atomic disorder are also measurable. This information is of particular value in 
the case of multielement amorphous materials, where diffraction techniques cannot 
distinguish the elastic scattering from different elements. 


4.6.1 Fourier Transform Method 


Following the original EXAFS procedure (Sayers et al., 1971), the radial distribution 
function (RDF) is obtained as a Fourier transform of the experimental EXELFS data 
(Kincaid et al., 1978; Johnson et al., 1981a; Leapman et al., 1981; Stephens and 
Brown, 1981; Bourdillon et al., 1984). The essential steps involved are as follows. 


4.6.1.1 Background Subtraction and Deconvolution 


Unless the specimen is very thin (<10 nm, for 100-keV electrons), plural scattering 
beyond the edge is first removed by deconvolution. If the Fourier ratio technique is 
used (Section 4.3.2), the pre-edge background is subtracted prior to deconvolution; 
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if a Fourier log method is employed (Section 4.3.1), the background is removed 
after deconvolution to yield the single-scattering core-loss spectrum J;!. 

EXELFS analysis is more straightforward if the data are recorded from a K-edge, 
but for atomic numbers greater than 15 the K-loss signal is weak and therefore noisy, 
so the L-edge may have to be used (Okamoto et al., 1991). In the case of transition 
metals, an L; edge occurs within the energy range covered by the L23 EXELFS and 
must be removed from the experimental data, for example, by subtracting a suitably 
chosen fraction of the intensity at energies above the L; threshold (Leapman et al., 
1981). For transition elements beyond Ti (Z = 22), the L2—L3 splitting exceeds 
5 eV, resulting in a “smearing” of the EXELFS, but this effect can be eliminated by 
deconvolving J,! with a pair of delta functions separated by the appropriate energy 
and weighted by a suitable ratio (Leapman et al., 1982). This deconvolution can 
be done by division of Fourier coefficients, before, during, or after the removal of 
plural scattering. 


4.6.1.2 Isolation of the Oscillatory Component 


The oscillatory part x (E) of the core-loss intensity is obtained by subtracting from 
J,!(E) a smoothly decaying function A(E), representing the single-atom intensity 
profile. In general, A(E) is not available experimentally and cannot be calculated 
with sufficient accuracy, so it is obtained empirically by fitting a smooth function 
through J;!, as in Fig. 4.20. This function should correctly follow the overall trend of 
the data but not the EXELFS modulations themselves, otherwise false structure will 
appear in the RDF at small values of radius r. An odd-order polynomial (Leapman, 
1982) and a cubic spline (Johnson et al., 1981a) have been used. A power-law 
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Fig. 4.20 (1) Oxygen K-edge of sapphire, recorded using 100-keV incident electrons and collec- 
tion semi-angle 6 = 16 mrad. Also shown are (2) the extrapolated background intensity, (3) the 
core-loss intensity after background subtraction, and (4) a smooth polynomial function A(E) fitted 
through the core-loss intensity. (A. J. Bourdillon, personal communication.) 


4.6 Analysis of Extended Energy-Loss Fine Structure 279 


600 700 800 900 


0 20 40 60 80 100 


Fig. 4.21. (a) x(£) and (b) RP x(k), obtained from the data shown in Fig. 4.20 


function may also be suitable if the exponent is allowed to vary somewhat with 
energy loss (Stephens and Brown, 1981). 
The difference spectrum is normalized by division with A(E) to give 


x(E) = [Jk (E) — A(E)|/A(E) (4.75) 


as shown in Fig. 4.21a. Defining x (£) as a ratio of intensities makes it unnecessary 
to divide by an angular correction function (Section 4.2). 


4.6.1.3 Scale Conversion 


The energy scale of x (£) is converted to one of wave number k of the ejected elec- 
tron using Eq. (3.165). If energies are measured in eV and k in nm!, the formula is 


k = 5.123 ./Ekin = 5.123(E — E°)!/? (4.76) 


where Ekin is the kinetic energy of the ejected inner-shell electron and E° is the 
energy loss corresponding to Exin = 0. E° is not precisely the observed threshold 
energy Ex, since the latter corresponds to the excitation of electrons to the first unoc- 
cupied electron level. In a metal, this would be the Fermi level and in the absence of 
exchange and correlation effects (Stern et al., 1980) one might expect E® = E,—Ef. 


280 4 Quantitative Analysis of Energy-Loss Data 


In insulators, the initial excitation is often to a bound state (Section 3.8.5), for which 
Exin < 0, leading to E® > E,. Because of possible chemical shifts, E® is best 
obtained from the experimental spectrum; the inflection point at the edge or an 
energy loss corresponding to half the total rise in intensity (Johnson et al., 198 1a) 
are possible choices. Unfortunately, an error in E® leads to a shift in the RDF peaks; 
for the boron K-edge in BN, Stephens and Brown (1981) found that the r-values 
changed by about 5% for a 5-eV change in E”. 

In fact, the most appropriate value of E® is related to the choice of energy zero 
assumed in calculating the phase shifts that are subsequently applied to the data. 
Lee and Beni (1977) proposed treating E° as a variable parameter whose value is 
selected, so that peaks in both the imaginary part and the absolute value (modulus) of 
the Fourier transform of x(k) occur at the same radius r. With suitably defined phase 
shifts (Teo and Lee, 1979), this method of choosing E® gave r-values mostly within 
1% of known interatomic spacings (up to fifth nearest neighbors) when applied to 
EXAFS data from crystalline Ge and Cu (Lee and Beni, 1977). 

Spectral data are usually recorded at equally spaced energy increments but after 
conversion of x(E) to x(k), the data points are unequally spaced. If a fast Fourier 
transform (FFT) algorithm is to be used, the k-increments must be equal and some 
form of interpolation is needed. For finely spaced data points, linear interpretation 
is adequate; in the more general case, a sinc function provides greater accuracy 
(Bracewell, 1978). Some conventional (discrete) Fourier transform programs can 
use unequally spaced x (k) data. 


4.6.1.4 Correction for k-Dependence of Backscattering 


According to Eq. (3.167), the RDF is modulated by the term fj(k)/k, where fj(k) is 
the backscattering amplitude. The x (k) data should therefore be divided by this term. 
A simple approximation is to take f(k) o k~?, based on the Rutherford scattering 
formula: Eq. (3.3) with g = 2k. As shown in Fig. 4.24, this provides a fair approxi- 
mation for light elements (e.g., C, O) but is inadequate for elements of higher atomic 
number. In some EXAFS studies, x(k) is multiplied by k” (as in Fig. 4.21b), the 
value of n being chosen empirically to emphasize either the low-k or the high-k data, 
and thus the contribution of low-Z or high-Z atoms to the backscattered intensity 
(Rabe et al., 1980). 


4.6.1.5 Truncation of the Data 


Before computing the Fourier transform, values of x(k) that lie outside a chosen 
range (k = kmin to kmax) are removed. The low-k data are omitted because single- 
scattering EXAFS theory does not apply in the near-edge region and because at low 
k the phase term g(k) becomes nonlinear in k. High-k data are excluded because they 
consist mainly of noise (amplified by multiplying by k”, as in Fig. 4.21b), which 
could contribute spurious fine structure to the RDF. The occurrence of another ion- 
ization edge at higher energy may also limit the maximum value of k. In typical 
EXELFS studies (Johnson et al., 1981a; Leapman et al., 1981; Stephens and Brown, 
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1981), kmin lies in the range 20-40 nm™! and kmax in the range 60-120 nm!. If too 
small a range of k is selected, the RDF peaks are broadened (leading to poor accu- 
racy in the determination of interatomic radii) and accompained by satellite peaks 
arising from the truncation of the data. These truncation effects can be minimized by 
using a window function W(k) with smooth edges (Lee and Beni, 1977) or by choos- 
ing kmin and kmax Close to zero crossings of x(k). When the limits have been suitably 
chosen, the RDF should be insensitive to the precise values of kmin and kmax. 


4.6.1.6 Fourier Transformation 


The required Fourier transform will be defined as follows: 


= Fi k 
XN= = [. Ww © Fm (k) exp (2ikr) dk (4.77) 


In practice, a discrete Fourier transform is used, so the variable k becomes zm/N 
(see Section 4.1.1) and the limits of integration are m=0 and m=N, where N is 
the number of data points to be transformed. If an FFT algorithm is used, N must 
be of the form 2”, where y is an integer, in which case the x(k) data may require 
extrapolation to values of k larger than kmax. A large value of N gives x at more 
closely spaced intervals of r. 

In the Fourier method of EXAFS or EXELFS analysis, interatomic distances are 
deduced directly from the positions of the peaks in the transform x(r). The rationale 
for this procedure is as follows. Ignore for the moment the effect of the window 
function and assume that the exponential and Gaussian terms in Eq. (3.167) are 
unity, corresponding to the case of a perfect crystal with no atomic vibrations and 
no inelastic scattering of the ejected core electron. We must also assume that the 
phase shift @;(k) can be written in the form 


p(k) = go + vik (4.78) 
Substitution of Eqs. (3.167) and (4.78) into Eq. (4.77) gives 
= 1° Ni. _ 
X(N=— YS sin (2krj + G0 + 1k) - [cos 2kr — isin 2kr]dk (4.79) 
IU —oo.. I; , 
i 


The imaginary part of the Fourier transform is 


Im[X (r)| = + a one {sin(2kr) sin[k(2r; + $1)] cos ¢o 
Joy (4.80) 
+ sin(2kr) cos[k(2r; + @1)] sin do}dk 


and is zero for most values of r, since the (modulated) sinusoid functions aver- 
age out to zero over a large range of k. However, if r satisfies the condition 
2r = 2rj+ G1, 
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Im[X ]=- 


rid cos bo fo. sin *[ (21; + $1) k] dk 
= = bs % cos bo 
7 i 


(4.81) 


Im[X] therefore consists of a sequence of delta functions, each of weight propor- 
tional to Nj/r? and located at r = r; + g/2, where j = 1,2, etc., corresponding 
to successive shells of backscattering atoms. Likewise, Re[x] is zero except at 
r = rj + 91/2, where it takes a value sin(yo)/27. Consequently, the modulus 
(absolute value) of ¥ can be written as 


[Z| = —t DO Yooin2go + cosbo)!/24(r — rj — 1/2) 
: (4.82) 


N; 
= ag hr — Hj - 1/2) 
jJod 


and is proportional to the radial distribution function N; iin r;. If Eq. (4.77) is written 
in terms of exp(ikr), or 2mikr as in Eq. (4.7), the RDF ee occur at r = 2r; + d1 
and at r = 2rj;/m + $1 /27, respectively. 

Including the Gaussian term of Eq. (3.167) is equivalent to convolving the trans- 


form x with a function of the form exp |-7 / (207)]. In other words, the effect of 


thermal and (in a noncrystalline material) static disorder is to broaden each delta 
function present in |x| into a Gaussian peak whose width is proportional to the cor- 
responding disorder parameter o;. To the extent that the inelastic mean free path A; 
may be considered to be independent of k, the effect of the exponential term in Eq. 
(3.167) is simply to attenuate the peaks in X, particularly at larger r;. Insofar as the 
window approximates to a rectangular function, its effect will be to convolve each 
Gaussian peak with a function W of the form (Lee and Beni, 1977) 


G7 — Sinkmaxr) __sin(2kminr) (4.83) 


r r 


Since kmax is usually several times of kmin, the first of these sinc functions is 
more important at small r, but both terms broaden the peaks in |x| and introduce 
oscillations between the peaks. 

In typical EXELFS studies (Johnson et al., 198la; Leapman et al., 1981; 
Stephens and Brown, 1981; Bourdillon et al., 1984), the widths of the 7(r) peaks are 
typically in the range of 0.02—0.1 nm (see Fig. 4.22), which is considerably larger 
than the thermal Debye—Waller broadening, at room temperature o; < 0.01 nm in the 
majority of materials (Stern et al., 1980). These peak widths, which determine the 
accuracy with which the interatomic radii can be measured, are therefore a reflection 
of the limited k-range. Fortunately, the sinc functions in Eq. (4.83) are symmetrical 
(about r= 0) and do not shift the maxima of | (7)| and Im[x(r)]. However, an error 
in the choice of E® introduces a nonlinear term into Eq. (4.78), shifting the maxima 
in |x(r)| and Im[X(r)] by unequal amounts. This forms the basis of the scheme for 
choosing E® by matching the peaks in these two functions (Lee and Beni, 1977). 
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4.6.1.7 Correction for Phase Shifts 


The final step in the Fourier method is to estimate the linear (in k) component © 
of the phase shift in order to convert the x(r) peak positions into interatomic dis- 
tances. The phase function ¢;(k) actually contains two contributions: a change @,(k) 
in phase as the ejected electron first leaves and then returns to the emitting (and 
“absorbing”’) atom, and also a phase change ,(k) that occurs upon backscattering 
from a particular atomic shell. For K-shell EXELFS, where (as a result of the dipole 
selection rule) the emitted wave has p-like character: 


g;(k) = gal (kK) + yolk) — © (4.84) 


The superscript on y,(k) refers to the angular momentum quantum number /’ of the 
final state (the emitted wave), while the final term (—7r) in Eq. (4.84) accounts for a 
factor (=i that should occur before the summation sign in Eq. (3.167), / referring 
to the angular momentum of the initial state. In the case of EXELFS on an Ly3 edge, 
the emitted wave is expected to be mainly d-like (/ =2) and the phase term takes 
the form (Teo and Lee, 1979) 


glk) = ga (k) + yolk) (4.85) 


Note that gp(k) depends on j and therefore on the atomic number of the backscatter- 
ing atom, while g,(k) depends on the atomic number of the emitting atom and on the 
angular momentum quantum number I of the emitted wave. The phase term ¢;(k) 
is therefore a property of two atoms (the emitting atom being specified) and of the 
type of inner shell (K, L, etc.) giving rise to the EXELFS. It has been postulated that 
yj(k) depends on the chemical environment, but experimental work suggests that 
this is not the case (Citrin et al., 1976; Lee et al., 1981), at least for k > 40 nm~!. In 
other words, “chemical transferability” of the phase shift can be applied, provided 
the energy zero E° in Eq. (4.75) is chosen consistently. 
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Among others, Teo and Lee (1979) have therefore carried out ab initio calcu- 
lations of @g(k) and p(k) using atomic wavefunctions and have tabulated these 
functions for certain values of k and atomic number. Data for intervening elements 
can be obtained by interpolation. Ground-state wavefunctions were assumed for 
most of the elements, which could lead to a systematic error in 7; if the emitting 
atom is strongly ionic (Stern, 1974; Teo and Lee, 1979). Some EXAFS workers 
have calculated phase shifts using wavefunctions of the higher adjacent element 
in the periodic table (the Z + 1 or “optical alchemy” approximation) to allow for 
relaxation of the atom following inner-shell ionization (Section 3.8.5). 

The value of g; for use in Eq. (4.78) may be taken as the average slope of the 
yj(k) curve in the region kpin to kmax (Leapman et al., 1981; Johnson et al., 1981a). 
Since this average slope is negative (Fig. 4.23) and since rj = r — 1/2 at the peak 
of x(r) the r-value corresponding to each x peak is increased by |¢|. 


4.6.2 Curve-Fitting Procedure 


Due to the width of the RDF peaks computed by Fourier transformation of x(k), it 
would be difficult to accurately distinguish atomic shells that are separated by less 
than 0.02 nm. An alternative procedure, employed successfully in EXAFS studies, 
is to use Eq. (3.167) to calculate x(k), starting from an assumed model of the atomic 
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structure, and then fit this calculated function to the experimental data by varying 
the parameters (fj, Nj, oj) of the model. 

This approach has several advantages. More exact expressions can be used for 
the phase function g;(k), by including (for example) kK? and k~? terms in Eq. (4.78) 
(Lee et al., 1977). The k-dependence of the backscattering amplitude f;(k) can be 
taken into account more precisely, for example, by using a Lorentzian function (Teo 
et al., 1977). This k-dependence can be different for different atomic shells and 
departs significantly from the k~? approximation in the case of medium- and high- 
Z elements; see Fig. 4.24. The k-dependence (energy-dependence) of the inelastic 
mean free path A; (Fig. 3.57) can also be included in the analysis. With this more 
accurate treatment, it has been possible in EXAFS investigations to estimate the 
coordination number N; and disorder parameter o; as well as interatomic distances. 

Particularly in the case of a completely unknown structure, the Fourier transform 
method (Section 4.6.1) may be used as a basis for selecting the initial parame- 
ters of the atomic model. These parameters are then refined by curve fitting to the 
experimental data, the process being an iterative one. Sometimes use is made of a 
technique known as Fourier filtering, in which the x(k) modulation arising from a 
single atomic shell is generated by back-transforming a small range of X(r), cor- 
responding to a single peak in the RDF (Eisenberger et al., 1978); the parameters 
of the model are then fitted shell by shell, starting with the shell of smallest radius. 
However, in many cases of practical interest the Debye—Waller and inelastic terms 
in Eq. (3.167) damp the small-k oscillations (corresponding to large r;) to such an 
extent that only nearest neighbor separations can be considered reliable. 

A further advantage of the curve-fitting procedure is that curvature of the emit- 
ted wave (Pettifer and Cox, 1983) and multiple scattering of the ejected electron 
(Lee and Pendry, 1975) can be taken into account. Equation (3.167) represents a 
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plane-wave approximation and tends to fail for higher order shells (i.e., at low k), 
where backscattering takes place further away from the nucleus of the backscatter- 
ing atom, which therefore “sees” a larger portion of the wave front. Equation (3.167) 
also assumes only single (elastic) scattering of the ejected electron, a condition that 
no longer applies to higher order shells. By removing these restrictions, EXAFS the- 
ory has been extended to the near-edge (XANES) region (Durham et al., 1981, 1982; 
Bianconi, 1983), forming a basis for analyzing energy-loss near-edge structure. 


4.7 Simulation of Energy-Loss Near-Edge Structure (ELNES) 


As outlined in Chapter 3, near-edge fine structure can be calculated in real space 
(multiple scattering procedure) or in reciprocal space (the band structure approach). 
Advantages of the real-space method are that it can more easily deal with non- 
crystalline situations, dopants or impurities, interfaces, and particles embedded in 
a matrix for example. Advantages of the density functional approach are that the 
same calculation can yield a large number of physical properties: band structure 
diagrams, elastic constants, etc., as well as ELNES and low-loss spectra. The two 
methods have been compared for the calculation of GaN, where it was concluded 
that the density functional approach was somewhat more accurate but more time 
consuming (Arslan et al., 2003; Moreno et al., 2006). Here we outline the properties 
of some readily available software packages that perform each type of calculation. 


4.7.1 Multiple Scattering Calculations 


As described in Section 3.9, Eq. (3.167) retains validity within 50 eV of an ioniza- 
tion threshold through the concept of an effective scattering amplitude that makes 
allowance for multiple scattering, curvature of the emitted wavefront, inelastic scat- 
tering, and core-hole effects. ELNES can therefore be simulated using an approach 
similar to EXELFS curve fitting (Section 4.6.2). 

These procedures are implemented in FEFF (Ankudinov et al., 1998), avail- 
able from the University of Washington (http://leonardo.phys.washington.edu/feff). 
Originally written for analysis of x-ray near-edge structure (XANES), this code has 
been adapted to ELNES. The FEFF8 version is described within that context by 
Moreno et al. (2007) and their review is largely applicable to FEFF9. The pro- 
gram contains six modules. The first calculates muffin-tin potentials of the atoms 
involved, using a self-consistent field (SCF) method. The second evaluates phase 
shifts, dipole matrix elements, and the local density of states (LDOS) in various 
angular momentum projections. A third module carries out full multiple scattering 
calculations for a specified cluster of atoms. The significant multiple scattering paths 
are then identified; effective scattering amplitudes are calculated for those paths and 
the ELNES oscillations are computed. 
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The input file is divided into cards that specify conditions for the calculation. 
FEFF9 introduces a graphical user interface to control the different cards, although it 
can also be run in the same way as FEFF$8. If a card is absent, default values are used. 
However, an ATOMS card is essential since it specifies the nature and coordinates 
of the emitting atom and its neighbors, and is usually compiled using a separate 
program through the WebAtoms interface (http://cars9.uchicago.edu/cgi-bin/atoms/ 
atoms.cgi). Unlike band structure methods, FEFF does not rely on atom periodity 
or crystal symmetry, so defects such as vacancies, impurities, and interfaces can be 
included in the atom cluster. 

The program is typically run first for a small cluster, containing first- and 
second-nearest neighbors, then the cluster size increased to see if the potentials 
and ELNES oscillations have converged to a limit. Next, various calculated prop- 
erties are checked, such as the densities of states and the Fermi level. In the GaN 
example shown in Fig. 4.25, the Fermi level was found to initially lie above the 
bandgap of the semiconductor. It was shifted downward by 1.7 eV to mid-gap 
position using the EXCHANGE card, which also allows a choice of the electron 
self-energy (either Hedin—Lundqvist or Dirac—Hara exchange correlation poten- 
tial), whose imaginary part determines the inelastic mean free path of the ejected 
electron. The COREHOLE card allows a choice between inclusion of a core hole 
(recommended for nonmetallic systems) and its absence due to free-electron screen- 
ing (appropriate for metals). In general, the effect of a core hole is to redshift the 
edge to lower energy and sharpen the rise at the threshold, as in Fig. 4.26c. 

The ELNES calculations include core-hole initial-state broadening (from tables) 
and final-state broadening (from imaginary part of the calculated self-energy) 
but instrumental broadening can also be introduced, through the EXCHANGE or 
CORRECTIONS card. For hexagonal GaN, inclusion of eight atom shells repro- 
duced all four peaks visible above the nitrogen K-edge but 142 shells (480 atoms) 
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Fig. 4.26 (a) Nitrogen K-edge for h-GaN calculated by FEFF using different numbers of atomic 
shells. (b) 42-shell result compared with experiment (open circles). (c) K-edge calculated with and 
without a core hole, compared with experiment. From Moreno et al. (2007), copyright Elsevier 


were necessary to achieve a reasonable matching with high-resolution (0.2 eV) 
EELS data; see Fig. 4.26b. 

Orientation dependence (anisotropy) is accommodated for XANES calculations 
through a POLARIZATION card; see Fig. 4.27. XANES calculations are done only 
for forward scattering but the ELNES card of FEFF9 makes allowance for the size of 
the collection aperture, incident beam convergence, relativistic cross sections, and 
sample/beam orientation. Allowance is made for an off-axis collection aperture and 
quadrupole transitions can be included via the MULTIPOLE card. Thermal vibra- 
tions can be added via the DEBYE card but are important only for higher specimen 
temperature (e.g., 600 K) and energies more than 50 eV beyond an edge. FEFF can 
calculate this extended fine structure (EXELFS) but uses a path expansion method 
rather than the full multiple scattering (FMS) procedure. 


4.7.2 Band Structure Calculations 


Most modern band structure methods are based on density functional theory (Kohn 
and Sham, 1965). Among many DFT codes, two offer the ability to calculate ELNES 
and are commercially available. CASTEP (http://www.castep.org/) is a pseudopo- 
tential program developed at the University of Cambridge and now marketed by 
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Fig. 4.27 Experimental data (open circles) compared with Wien and FEFF calculations of the 
nitrogen K-edge of hexagonal GaN, for two different principal directions of momentum transfer q. 
The upper spectra relate to p,y transitions and the lower ones to p; transitions. From Moreno et al. 
(2007), copyright Elsevier. See also Moreno et al. (2006) for Wien/FEFF comparison 


Accelrys (http://accelrys.com/). Wien2k (http://www.wien2k.at/) was developed at 
Vienna University of Technology from the original Wien program (Blaha et al., 
1990). It incorporates the TELNES program for calculating ELNES and the OPTIC 
package, which can generate a low-loss dielectric function. A practical guide to 
its use in EELS is given by Hébert (2007), and the description below is based on 
that review. Seabourne et al. (2009) give further discussion of the choice of input 
parameters for the Wien and CASTEP codes. 
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The first step in the Wien2k procedure is initialization, including choice of the 
radius of the atomic sphere, within which initial-state wavefunctions are calculated. 
Valence electrons are also taken to be atomic within this sphere and plane waves 
outside. Incorrect choice leads to an unrealistic contribution of monopole terms 
to the ELNES, so for low-lying (~100 eV) edges a dipole approximation may be 
necessary. The electron density is calculated and refined iteratively to generate a 
self-consistent field (SCF) that satisfies the Schrédinger equation. 

Next the DOS and ELNES are calculated, initially with a small number of 
points within the Brillouin zone. Then the number of k-values is increased until 
the result of the SCF calculation converges to a limit. Typically at least 5000 
k-points are required; see Fig. 4.28. Besides calculating the DOS, the TELNES pro- 
gram calculates the matrix element of Eq. (3.162) and integrates over momentum 
transfer. 

To compare with experimental data, the ELNES is broadened by a Gaussian func- 
tion, whose width represents the instrumental energy resolution, and by a Lorentzian 
to allow for core hole lifetime. An energy-dependent correction is also made for 
final-state lifetime; see Section 3.8.1. 

Correction for the effect of the core hole is done using the Z + 1 approximation or 
else by removing one core electron in the model and adding it either to the number of 
valence electrons or to the background charge, thereby preserving charge neutrality 
within the unit cell. A supercell must be used, since the core hole occurs only once; 
the number of atoms in this cell is increased until the ELNES converges. A 64-atom 
supercell is usually sufficient. A partial core hole is possible; for the L3-edge in Cu, 
a half-hole gave the best agreement with experiment. 


100 k-points ; 

500 k—points 5000 k-points 
2000 k-points 0.02 10000 k—points 
20000 k—points 


5000 k-points 


SSE o-= 


1 
lp 
HH 
fy 


Intensity [arb. units] 


0 4 8 0 4 8 


Fig. 4.28 Wien2k calculations of the L3-edge of fcc copper, for different numbers of k-points in 
the Brillouin zone. Lifetime broadening and instrumental broadening (0.7 eV) have been included. 
From Hébert (2007), copyright Elsevier 
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Recent versions of TELNES can calculate for anisotropic materials as a func- 
tion of specimen orientation and taking into account the convergence and collection 
angles. To be accurate, this calculation must be fully relativistic (Appendix A). 

The OPTIC package can calculate the low-loss dielectric function, for compar- 
ison with that derived by Kramers—Kronig analysis of experimental EELS data. 
Alternatively, WIEN2k can calculate the low-loss spectrum itself. This last approach 
was used by Keast (2005), who calculated low-loss spectra of fourth, fifth, and some 
sixth row elements of the periodic table and compared the results with experimental 
data from the EELS atlas. 

For accurate simulation of transition metal L-edges, multiplet effects 
(Section 3.8.4) can be simulated by the CTM4XAS program (which can be accessed 
by email to f.m.f.degroot @uu.nl). It is applicable to both XAS and EELS (Stavitski 
and de Groot, 2010) but is intended to be used as an initial tool, prior to ab initio 
multiplet calculations. 


Chapter 5 
TEM Applications of EELS 


This final chapter is designed to show how the instrumentation, theory, and meth- 
ods of EELS can be combined to extract useful information from TEM specimens, 
with the possibility of high spatial resolution. As in previous chapters, we begin 
with low-loss spectroscopy and energy filtering, followed by core-loss analysis and 
elemental mapping, including factors that determine detection sensitivity and spa- 
tial resolution. Structural information obtained through the analysis of spectral fine 
structure is then discussed, and a final section shows how EELS has been applied 
to a few selected materials systems. Meanwhile, Table 5.1 lists the information 
obtainable by energy-loss spectroscopy and by alternative high-resolution methods. 


5.1 Measurement of Specimen Thickness 
It is often necessary to know the local thickness of a TEM specimen, to convert the 


areal density provided by EELS or EDX analysis into elemental concentration, or to 
estimate defect concentration from a TEM image, for example. Several techniques 


Table 5.1 Analytical data obtainable by TEM and other methods 


EELS measurement Information obtainable Alternative methods 
Low-loss intensity Local thickness, mass thickness CBED, stereoscopy 
Plasmon energy Valence-electron density 

Plasmon peak shift Alloy composition CBED, EDXS 
Low-loss fine structure Dielectric function, JDOS Optical spectroscopy 
Low-loss fingerprinting Phase identification e~ or x-ray diffraction 
Core-loss intensities Elemental analysis EDXS, AES 
Orientation dependence Atomic site location X-ray ALCHEMI 
Near-edge fine structure Bonding information XAS (XANES) 
Chemical shift of edges Oxidation state, valency XPS, XAS 

L or M white-line ratio Valency, magnetic properties XPS, XAS 

Extended fine structure Interatomic distances EXAFS, diffraction 
Bethe ridge (ECOSS) Bonding information y-ray Compton 
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are available for in-situ thickness measurement. Analysis of a convergent-beam 
diffraction pattern can achieve 5% accuracy (Castro-Fernandez et al., 1985) but is 
time consuming and works only for crystalline specimens. Methods based on tilting 
the specimen and observing the lateral shift of surface features (e.g., contamination 
spots) are less accurate and may interfere with subsequent microscopy of the same 
area. Measurement of the bremsstrahlung continuum in an x-ray emission spec- 
trum (Hall, 1979) can give the mass thickness of organic specimens to an accuracy 
of 20% but involves substantial electron dose and possible mass loss (Leapman 
et al., 1984a, b). Measurement of the elastic scattering from an amorphous specimen 
yields thickness in terms of an elastic mean free path or in terms of absolute mass 
thickness if the chemical composition is known (Langmore et al., 1973; Langmore 
and Smith, 1992; Pozsgai, 2007). 


5.1.1 Log-Ratio Method 


The most common procedure for estimating specimen thickness within a region 
defined by the incident beam (or an area-selecting aperture) is to record a low-loss 
spectrum and compare the area Jy under the zero-loss peak with the total area /; 
under the whole spectrum. From Poisson statistics (Section 3.4), the thickness ¢ is 
given by 


t/d = In(I;/Io) (5.1) 


where A is the total mean free path for all inelastic scattering. As discussed in 
Section 3.4.1, 4 in Eq. (5.1) should be interpreted as an effective mean free path 
A(B) if a collection aperture limits the scattering angles recorded by the spectrom- 
eter to a value f, especially if this aperture cuts off an appreciable fraction of the 
scattering (e.g. 6 < 20 mrad). 

Before applying Eq. (5.1), any instrumental background should be subtracted 
from the spectrum. Particularly for very thin specimens, correct estimation of this 
background is essential for accurate thickness measurement. In the case of data 
recorded from a CCD camera, the appropriate background will be a dark-current 
spectrum acquired shortly before or after the energy-loss data, recorded with the 
same integration time and number of readouts. 

Measurement of /; and Jp involves a choice of the energies ¢, 5, and A that define 
the integration limits; see Fig. 5.1. The lower limit (—e) of the zero-loss region can 
be taken anywhere to the left of the zero-loss peak where the intensity has fallen 
practically to zero. The separation point 6 for the zero-loss and inelastic regions 
can be taken as the first minimum in intensity (Fig. 5.1) on the assumption that 
errors arising from the overlapping tails of the zero-loss and inelastic components 
approximately cancel. Alternatively, Jj is measured by fitting the zero-loss peak to 
an appropriate function, whose integral is known. The upper limit A should corre- 
spond to an energy loss above which further contribution to J, does not affect the 
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Fig. 5.1 The integrals and 
energies involved in applying 
the log-ratio method to 
measure specimen thickness 


required accuracy. Although A ~ 100 eV is sufficient for a very thin light element 
specimen, a larger value is needed for thicker or high-Z specimens, where inelastic 
scattering extends to higher energy loss, due to contributions from plural scatter- 
ing and inner shells, respectively. The “compute thickness” procedure in the Gatan 
EELS software reduces the need for recording a large energy range by extrapolating 
the spectrum to higher energy loss. 

The analysis of Section 3.4.3 indicates that Eq. (5.1) is relatively unaffected 
by elastic scattering, even if a large fraction of the latter is intercepted by an 
angle-limiting aperture. This situation arises because the elastic scattering is accom- 
panied by a nearly equal fraction of mixed (elastic+inelastic) scattering. In practice, 
Eq. (5.1) has been judged to be valid (within about 10%) for #/A as large as 4 
(Hosoi et al., 1981; Leapman et al., 1984a; Lee et al., 2002). In the case of very 
thin specimens (f/A < 0.1), surface excitations are significant and might cause an 
overestimate of thickness (Batson, 1993a). 

For t/A > 5, an alternative procedure is available for thickness measurement, 
based on the peak energy and width of the multiple scattering distribution (Perez 
et al., 1977; Whitlock and Sprague, 1982); see Section 3.4.4. 


5.1.1.1 Measurement of Absolute Thickness 


Equation (5.1) provides a thickness in terms of the inelastic MFP, which can be 
useful for measuring the relative thicknesses of similar specimens or thickness vari- 
ations within a specimen of uniform composition. To obtain absolute thickness, 
a value for A is required. A rough estimate (in nm) is given by A * (0.8)Ep, 
where Eo is the incident electron energy in keV. For 100-keV electrons and 4 > 
5 mrad, this estimate is valid within a factor of 2 for typical materials except ice; see 
Table 5.2. 

For materials of known composition, the inelastic mean free path can be calcu- 
lated. However, atomic models such as that of Lenz (1954) yield cross sections that 
may be too high, so that A is underestimated; the free-electron plasmon formula, 
Eq. (3.58), gives mean free paths that are appropriate for some materials but are 
generally an overestimate. 

Realistic values of mean free path are possible by using scattering theory to 
parameterize A in terms of the collection semi-angle £, the incident energy Eo, and 
a parameter that depends on the chemical composition of the specimen. Assuming 
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Table 5.2 Values of Em obtained from energy-loss measurements, together with inelastic mean 
free paths for 100-keV electrons 


100-keV MFP (nm) 
Material Type of specimen Reference Em (eV) 2 (10 mrad) 2 (100 mrad) 
Al Single-crystal foil M&88 17.2 100 100 
Al Polycrystalline film C90, YE94 16.8 101 101 
AlnO3 Polycrystalline film E92 15.9 106 106 
Ag Polycrystalline film EC87, C90 26.3 71 71 
Au Polycrystalline film EC87, C90 35.9 56 56 
Be Single-crystal foil M&88 12.4 129 129 
BN Crystalline flake E81c 17.2 99 99 
Cc Arc-evaporated film C90, E92 14.2 116 116 
Cc Coo thin film E92 14.4 115 115 
C Diamond crystal E92 19.1 88 88 
Cr Polycrystalline film E92 25.1 74 74 
Cu Polycrystalline film C90 30.8 63 63 
Fe Polycrystalline film EC87, C90 25.0 74 57 
(Fe) 306 stainless steel M&88 23:3 78 61 
GaAs Single crystal E92 18.2 95 74 
Hf Single-crystal foil M&88 35.3 57 41 
H20 Crystalline ice S&93, E92 6.7 220 200 
NiO Single crystal M&88 19.8 89 71 
Si Single crystal EC87 15.0 111 91 
SiO2 Amorphous film E92 13.8 119 99 
Zr Single-crystal foil M&88 24.5 75 57 


C90 = Crozier (1990); E81c = Egerton (1981c); E92 = Egerton (1992a); EC87 = Egerton and 
Cheng (1987); M&88 = Malis et al. (1988); S&93 = Sun et al. (1993); YE94 = Yang and Egerton 
(1995). The last two columns give mean free paths for 100-keV incident electrons: A(10 mrad) for 
B = 10 mrad and (100 mrad) for 6 © 100 mrad, obtained from 4(10 mrad) by making use of the 
angular distribution predicted by Eq. (3.16) 


B << (E/Eo)'/*, implying 6 < 15 mrad at Ey = 100 keV, Malis et al. (1988) 
parameterized the inelastic mean free path on the basis of a dipole formula: 


_, 106F(Eo/Em) 


_— (5.2) 
InQBEo/Em) 
In Eq. (5.2), A is in nm, # in mrad, Ep in keV, and Ey, in eV; F is a relativistic factor 


(0.768 for Eo = 100 keV, 0.618 for Eg = 200 keV) defined by 


T 2 1+ Ep/1022 keV 
F= = iat = + o/ . 5) (5.2a) 
Ey 2Eo  (1+£o/511 keV) 


Equation (5.2) is based on Eq. (3.58), with an appropriate value of F but with a 
nonrelativistic expression for 6, within the logarithm term. 

By recording the low-loss spectrum from a specimen of known thickness, with 
known £ and Ep, A can be determined from Eq. (5.1) and converted to Em by itera- 
tive use of Eq. (5.2). Materials for which this has been done are listed in Table 5.2; 
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the appropriate values of FE; can then be used in Eq. (5.2) to calculate the mean free 
path appropriate to a particular collection angle, as in the PMFP program (Appendix 
B). 

For an element not listed in Table 5.2 but whose atomic number Z is known, 
Malis et al. (1988) proposed a formula based on measurements of 11 materials at 
incident energies of 80 and 100 keV: 


Em © 7.626 (5.3) 


This formula is roughly consistent with the Lenz atomic model of inelastic scat- 
tering, Eq. (3.16), but makes no allowance for differences in crystal structure or 
electron density; it would predict the same mean free path for graphite, diamond, 
and amorphous carbon, for example. In the case of a compound, the Lenz model 
suggests an effective atomic number for use in Eq. (5.3): 


We tae 
DifiZp? 


where fj is the atomic fraction of each element of atomic number Z;. 

For large collection apertures (8 > 20 mrad for Eo = 100keV, >10 mrad at 
200 keV), Eq. (5.2) is not applicable. This total inelastic mean free path, appro- 
priate to low-loss spectra recorded without an angle-limiting aperture, is given to a 
reasonable approximation by substituting 8 = 25 mrad (15 mrad at 200 keV) in Eq. 
(5.2). Estimates of total inelastic mean free path for 100-keV electrons are given in 
the last column of Table 5.2; 200-keV measurements are tabulated in Appendix C. 

More recently, Iakoubovskii et al. (2008a, b) used a 200-kV STEM probe 
(a = 20 mrad, 6 = 5 mrad) to measure #/X in specimens of 36 elements and 34 
binary oxides. Local thickness was determined mainly from the Kramers—Kronig 
sum rule (Section 5.1.2). The resulting values for 4 are tabulated in Iakoubovskii 
et al. (2008b). They are generally larger than given by Eq. (5.2) and were found 
to be more accurately modeled as a function of specimen density p rather than 
atomic number; see Fig. 5.2a, b. As a result, Iakoubovskii et al. (2008a) proposed 
the following formula: 


200FE 24 624992452 92 
ie fg PT ee (5.5) 
1193 gb Bob 202 be ge 


Leff ~ (5.4) 


where the incident convergence semi-angle @ and the collection semi-angle 6 are 
in mrad, 67 = |a? — B*|, 6- = 20 mrad, p is the specific gravity of the specimen 
(density in g/cm). The characteristic angle 6 ¢ was defined as 


Op = 5.5p°3 /(FEp) (5.5a) 


where the relativistic factor F is given by Eq. (5.2a). Equation (5.5a) incorporates 
an approximation to the incident convergence correction that does not assume that a 
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Fig. 5.2. Log—log plot of inelastic mean free path (for large ) as a function of (a) effective atomic 
number Zerp and (b) density p of the specimen. Note the smaller amount of scatter in the latter 
case. Squares represent values measured by Iakoubovskii et al. (2008b) at Ey = 200 keV. Filled 
circles are for Ey = 100 keV and are derived from Eq. (5.2) by scaling to large collection angle by 
the use of Eq. (3.16). (c) Measured inelastic mean free path A for SiOz as a function of collection 
semi-angle f and incident beam convergence angle a. The vertical arrow shows the limit to 6 
imposed by the lens bore of a typical TEM. (d) Collection angle dependence of 4 fora = 3.2 
mrad. From Jakoubovskii et al. (2008a), copyright Wiley 


and £6 are below the cutoff angle 6., taken in Eq. (5.6) to be 20 mrad. The measured 
a- and 6-dependence of 4 are shown for amorphous SiO in Fig. 5.2c, d, together 
with the parameterized formula, Eq. (5.5). Note that the value of A is essentially 
determined by the larger of a and f. 

Equation (5.5a) implies 6g = E/(ymov*), where E represents some mean 
energy loss, whereas 6g = E/(mov’) relativistically; see Appendix A. This is 
arguably a worse choice than 6g ~*~ E/(2Eo) as assumed in Eq. (5.2), since 
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Fig. 5.3 Inelastic mean free 200 
paths for elemental solids at 
B = 10 mrad. Equation (5.5) 


is represented by filled circles € 150 
at 200 keV and by open = 
circles at 100 keV. Equation & 
(5.2) is represented by a solid 2 400 
line at 200 keV and a broken % 
line at 100 keV 2 


50 @ lakoubovskii& 200keV 
— Malis et al. 200keV 
© lakoubovskii& 100keV 


cre Malis et al. 100keV atomic number Z— 


(ymov? /2) = yFEo = 172keV at Ey = 200 keV, whereas FEp = 124 keV. 
The Eo scaling of Iakoubovskii’s Eq. (5.5) might therefore be improved by replac- 
ing F in Eq. (5.5a) by Fg = yF. Note that Eq. (5.5a) implies E = 11p°3, whereas 
Eq. (3.41a) gives Ep « (zp /A)'/? for the free-electron plasmon energy. 

The main differences between Eqs. (5.2) and (5.5) are that the Iakoubovskii mean 
free paths show pronounced oscillation with atomic number and are on average a 
factor 1.4 (at 200 keV) or 1.3 (at 100 keV) larger than those given by the Malis et al. 
formula; see Fig. 5.3. 

Bonney (1990) reported that Eq. (5.2) gave thickness to within 10% when tested 
on sub-micrometer vanadium spheres whose thickness was taken to be the same 
as their diameter. The 100-keV measurements of Crozier (1990) are also within 
+15% of the Malis et al. (1988) formula. The parameterization of Iakoubovskii 
et al. (2008a) more accurately reflects the Z-dependence of Crozier’s measurements 
but overestimates the absolute values by an average of 25%; see Fig. 5.4. 
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Fig. 5.5. Solid circle: 200 
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Figure 5.5 shows 200-keV measurements of Jin and Li (2006). Here the 
Takoubovskii et al. (2008a) formula matches the experimental data within 10%. The 
dipole approximation is not valid for this data, so the Malis et al. (1988) formula 
underestimates 1. Jin and Li obtained a good fit to their data by replacing Eq. (5.2a) 
by Em = 43.5 Z°-47/A where A represents atomic weight. 

Figure 5.6 shows 200-keV measurements on silicon, all within 10% of the 
Iakoubovskii formula (solid curve) at medium values of 6. The agreement is 
improved further (dash-dot curve) by replacing F in Eq. (5.5a) by Fg = y F. 

For most materials, the lakoubovskii et al. (2008a) parameterization of 1. appears 
to represent an improvement over that of Malis et al. (1988) in terms of the accu- 
racy, convenience of use (requiring density rather than an Ey, value), and the fact 
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that the formula should apply to large 6 (e.g., 100 mrad, limited only by post- 
specimen lenses). Advantages of large f are that its exact value need not be known 
and convergence correction is unnecessary, so the incident probe convergence a is 
not required. The programs IMFP and PMFP, described in Appendix B, calculate 2 
for the different formulas described above. 

Although Eq. (5.1) involves specimen thickness f¢, it is actually the total scat- 
tering and mass thickness that is measured by EELS. If the physical density of a 
material were reduced by a factor f, the scattering per atom would remain the same 
(according to an atomic model) and the mean free path should increase by a factor f. 
This prediction was confirmed by Jiang et al. (2010), using crystalline MgO and 
nanoporous MgO whose density was about half the bulk value, resulting in a mea- 
sured 2 about a factor of 2 larger. Such arguments ignore the presence of surface 
plasmon losses at internal pores, and as pointed out by Batson (1993a) the presence 
of surfaces increases the total scattering, despite the begrenzungs effect (Fig. 3.25). 
In practice, this increase appears to be modest. Shindo et al. (2005) measured mean 
free paths differing by a factor ~1.7 for diamond-like carbon films prepared by 
different methods, with physical densities between 1.4 and 2.1 g/cm?. 


5.1.1.2 Organic Specimens 


Biological specimens vary in porosity and are usually characterized in terms of mass 
thickness pt, which can be determined from a variant of Eq. (5.1), namely 


pt = pd In(I;/Io) = (1/o") Ini/Io) (5.6) 


where o is a cross section per unit mass. Calculations of Leapman et al. 
(1984a, b) based on Thomas—Fermi, Hartree—Fock, and dielectric models (Ashley 
and Williams, 1980) suggested that pA varies by no more than +20% for biologi- 
cal compounds, although these different models predicted values of oA differing by 
almost a factor of 2 (e.g., 8.8 g/cm? to 15 jxg/cm? for protein at Ey = 100keV). 
Other measurements and calculations based on the Bethe sum rule gave pA = 
17.2 g/cm? for protein at 100-keV beam energy (Sun et al., 1993). 

The only data processing involved in the log-ratio method is separation of the 
spectrum into zero-loss and inelastic components, both of which are strong signals 
and relatively noise free. Measurements can therefore be performed rapidly, with 
an electron exposure of no more than 10~!? C. Even for organic materials, where 
structural damage or mass loss can occur at a dose as low as 10~* C/cm?, thickness 
can be measured with a lateral spatial resolution below 100 nm. In this respect, the 
log-ratio technique is an attractive alternative to the x-ray continuum method (Hall, 
1979), which requires electron exposures of 10~° C or more to obtain adequate 
statistics (Leapman et al., 1984a). Rez et al. (1992) employed the log-ratio method to 
measure the thickness of paraffin crystals, with a reported accuracy of 0.4 nm under 
low-dose (0.003 C/cm?) and low-temperature (— 170°C) conditions. Leapman et al. 
(1993a) used similar methods to measure 200-nm-diameter areas of protein (cro- 
toxin) crystals and achieved good agreement with thicknesses determined using the 
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STEM annular dark-field signal, after correcting the latter for nonlinearity arising 
from plural elastic scattering. 

Zhao et al. (1993) measured #/A for a biological thin section by fitting its carbon 
K-edge to a sum of components derived from K-loss and plasmon-loss single- 
scattering distributions, both recorded from a pure carbon film, giving a “carbon 
equivalent” thickness. This procedure is convenient to the extent that it does not 
require recording of the low-loss region of the thin section, but it involves a radiation 
dose about 100 times higher than that required by the log-ratio method. 


5.1.2 Absolute Thickness from the K-K Sum Rule 


As described in Section 4.2, Kramers—Kronig analysis of an energy-loss spectrum 
gives a value for the absolute specimen thickness, along with energy-dependent 
dielectric data, without requiring the chemical composition of the specimen. The 
procedure involves extraction of the single-scattering distribution S(Z) from a mea- 
sured spectrum, use of the Kramers—Kronig sum rule to derive the energy-loss 
function Im[—1/e(E)], and removal of the surface scattering component of S(E) by 
iterative computation; see Appendix B. 

If specimen thickness is the only requirement, the procedure can be simplified. 
Combining Eqs. (4.27) and (4.26) gives 


(5.7) 


; 4ayFEo i S(E)dE 

~ In{1 = Re[1/e(0)]} J Eln(l + 62/68) 
where a9 = 0.0529 nm, F is the relativistic factor given by Eq. (5.2a), and 
Og is the characteristic angle defined by Eq. (3.28). In general, Re[1/e(0)] = 
E] /(e1" + £92), where ¢€; and €2 are the real and imaginary parts of the optical 
permittivity. However, as discussed in Section 4.2, Re[1/e(0)] can be taken as zero 
for a metal or semimetal and as 1/n? for an insulator or semiconductor of refractive 
index n. 

The 1/E weighting factor makes the integral in Eq. (5.7) less sensitive to higher 
orders of scattering. Provided the specimen is not too thick (t/A < 1.2), the effect of 
plural scattering can be approximated by dividing the integral by a correction factor 
(Egerton and Cheng, 1987) 


CX 140.3(t/A) = 1+0.3 Ini /Io) (5.8) 


Kramers—Kronig analysis carried out on thin films of Al, Cr, Cu, Ni, and Au 
(Egerton and Cheng, 1987) suggested that a more accurate value of thickness is 
obtained by subtracting an amount At (* 8 nm) from the value given by Eq. (5.7) 
to allow for surface plasmon scattering. With these approximations, and assuming 
B?/0z7 >> 1 over the energy range where the inelastic intensity is significant, 
Eq. (5.7) can be simplified to 
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(5.9) 


ee 2aoT / J(E)dE 
~ Clo{1—n-2} J Eln(B/6,) 
0 


Here J(E) represents the inelastic component of the energy-loss spectrum, including 
plural scattering but excluding the zero-loss peak. A computer program (TKKs) that 
evaluates Eq. (5.9) is described in Appendix B. 

Because of the 1/E weighting in Eq. (5.9), the value of ¢ is particularly sensitive 
to data at very low energy loss. The procedure used to separate the elastic scattering 
peak from the inelastic intensity is therefore important. The simplest method is to 
truncate the spectrum at the first minimum, EF = 6 in Fig. 5.1, but it results in 
an underestimation of f (square data points in Fig. 5.7), since contributions below 
E = 6 are missing. Omitting the surface correction, by setting Arto zero in Eq. (5.9), 
compensates for this missing contribution and gives a more realistic thickness value 
(solid circles in Fig. 5.7). Linear or parabolic extrapolation of the inelastic intensity 
(at E = 4) to zero (at E = 0) also gives acceptable results (triangles in Fig. 5.7). 
Another strategy is to model the tail of the zero-loss peak and subtract this from the 
spectrum to give the inelastic intensity J(E). 

The errors involved in these procedures can be minimized by optimizing the 
energy resolution. Spectra read out from a CCD camera should be recorded with 
a sufficiently high energy dispersion (low electron volt/channel) to minimize the 
energy range over which tails of the zero-loss peak are significant, even though this 
results in spectra with a restricted energy range. Because of the 1/E weighting in 
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Fig. 5.7 Thickness (in nm) of Al, In, Sn, Sb, Te, and Ag films measured using the Kramers—Kronig 
sum rule with different treatments of the low-energy limit (Yang and Egerton, 1995). Inverted and 
upright triangles denote linear and parabolic interpolation (respectively) between the origin and 
the first intensity minimum. Eliminating data below the minimum gave values represented by the 
squares and solid circles, the surface term At being set to zero in the latter case. Horizontal lines 
represent the film thickness determined by weighing, assuming bulk densities 
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Eq. (5.9), the upper limit of integration can be as low as 100 eV for reasonably 
thin specimens. If #/A > 1.2, the plural scattering correction in Eq. (5.9) becomes a 
poor approximation. A better procedure is then to use Fourier log deconvolution to 
remove the plural scattering component and employ Eq. (5.9) with C = 1. 

Because the Kramers—Kronig procedure is based on the equivalence of energy 
loss and optical data, the spectral intensity J(E) must be dominated by dipole scat- 
tering, implying a small collection semi-angle 6 (which must be known). In practice, 
values up to 18 mrad give acceptable results at Eg = 100 keV (Fig. 5.7); at 200 keV, 
this condition becomes # < 10 mrad. 

Iakoubovskii et al. (2008a) found good agreement between thicknesses obtained 
from the K-K sum rule (with Ey = 200 keV, B* © 12 mrad) and those deduced 
from convergent-beam diffraction. These measurements were used to derive inelas- 
tic mean free paths for many elements and oxides (Iakoubovskii et al., 2008b); see 
Appendix C. 

The advantage of the K-K sum rule method is that it gives thickness without 
knowledge of the material properties of the specimen, if the latter is conducting 
so that Re[1/e(0)] + 1 — n-* = 0. In the case of semiconducting and insulating 
specimens, an optical permittivity or refractive index n is required, and in the case 
of nanoporous materials, n will be reduced because of the lower physical density. 


5.1.3 Mass Thickness from the Bethe Sum Rule 


As in Section 3.2.2, the single-scattering intensity S(E) can be written in terms of a 
differential oscillator strength df/dE rather than the energy-loss function Im(—1/e). 
Combining Eqs. (3.33) and (4.26) gives 


S(E) = 


Am IpNagR? e 
Bide aha! a (14 Bp Fe (5.10) 


FEE 62 } dE 


where N is the total number of atoms per unit area and df/dE is a dipole (small-q) 
oscillator strength. Taking all the E-dependent terms to the right-hand side of this 
equation, integrating over energy loss, and making use of the Bethe sum rule of 
Eq. (3.34), we have 


pt = ANu = 


uFEy A / ES(E) eins 
ZJ In( 
0 


4m apR? Ip 1+ 2/62) 


where u is the atomic mass unit and pt is the mass thickness of the specimen, A and 
Z being its atomic weight and atomic number. For a compound, A is replaced by the 
molecular weight and Z by the total number of electrons per molecule. 

The integral in Eq. (5.11) is relatively insensitive to the instrumental energy res- 
olution, allowing S(E) to be taken as the intensity J '(E) obtained from Fourier log 
deconvolution of experimental data. The combined effect of the other terms within 
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the integral is to weight S(E) by a factor typically between E and E*, implying 
that the spectrum must be measured up to rather high energy loss to ensure conver- 
gence of the integral. Convergence is further delayed because inner atomic shells 
contribute to S(E) at energy losses above their binding energy. This means that 
Eq. (5.11) is useful only for specimens composed of light elements, with K-shell 
binding energies below 1000 eV. However, this category includes most organic and 
biological materials, containing mainly carbon, oxygen, and hydrogen. Figure 5.8 
shows the E-dependence of Eq. (5.11) for pure carbon, where the integral reaches 
its saturation value at about 1000 eV. 

The need for such an extended energy range means that the spectrum must be 
recorded in segments with different integration times and spliced together. Because 
the intensity is integrated in Eq. (5.11), good statistics and energy resolution are 
not important and the high-F data can be quite noisy. However, any detector back- 
ground (dark current) must be carefully removed, together with any spectrometer 
background that arises from stray scattering, as discussed in Chapter 2. Unless 
the specimen is extremely thin, plural scattering should be removed by Fourier log 
deconvolution. 

For light elements, the ratio A/Z in Eq. (5.11) is close to 2, but a better approx- 
imation for most biological materials is to take A/Z = 1.9 (Crozier and Egerton, 
1989), in which case Eq. (5.11) can be rewritten as 


CO 
= BEo (1 + Eo/1022) EJ'(E) 


5.12 
Ip (1+ £o/S11)° J In (1 + 62/62) oe) 


where B = 4.88 x 107!! g/cm E is in eV, and Eo in keV. This equation was 
tested on thin films of copper phthalocyanine (pt up to 30 j1g/em?, corresponding 
to t/A * 1.5) and yielded mass thickness values within 10% of those determined 


Fig. 5.8 Value of the integral 
in Eq. (5.11), expressed as an 
effective number of 
contributing electrons defined 
by Eq. (4.33). Note that rege 
almost saturates as E 
approaches the K-shell 
binding energy (284 eV) but 1 
approaches its full value (6) 0 
only for energy losses around 

1000 eV. From Sun et al. 

(1993), copyright Elsevier 
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by weighing (Crozier and Egerton, 1989). The Bethe sum rule has also been used to 
determine the cross section per unit mass of protein and water (Sun et al., 1993). 

The Bethe sum rule method involves an electron exposure (typically 10~!° C for 
parallel recording) higher than that needed to apply the log-ratio method (~107 
C) but its potential accuracy is higher. The bremsstrahlung continuum method (Hall, 
1979) is useful for thicker specimens (¢ > 0.5 zm) but involves an electron exposure 
of the order of 10~° C, sufficient to cause significant mass loss in most organic 
materials if the diameter of the incident beam is less than 1 4m (Leapman et al., 
1984a). 

With suitable calibration, local mass thickness can also be obtained by integrat- 
ing the inelastic scattering over chosen ranges of scattering angle and energy loss, 
using an electron spectrometer to reject the elastic and unscattered components (Feja 
et al., 1997). This may be a lower dose alternative to measuring high-angle elastic 
scattering with a STEM and ADF detector (Wall and Hainfeld, 1986; Feja and Aebi, 
1999). 


5.2 Low-Loss Spectroscopy 


The 1-50 eV region of the energy-loss spectrum contains peaks that arise from 
inelastic scattering by outer shell electrons. In most materials, the major peak can be 
called a plasmon peak; its energy is related to valence electron density and its width 
reflects the damping effect of single-electron transitions (Section 3.3.2). In some 
cases, these transitions appear directly in the low-loss spectrum as peaks or fine 
structure oscillations superimposed on the plasmon peak. The low-loss spectrum 
is then characteristic of the material present within the electron beam and can be 
used to identify that material, if suitable comparison standards are available and the 
spectral data have a sufficiently low noise level. 


5.2.1 Identification from Low-Loss Fine Structure 


If a specimen contains regions that give rise to sharp plasmon peaks, the materials 
involved are easily identified, as demonstrated for metallic sodium, aluminum, or 
magnesium by Sparrow et al. (1983) and Jones et al. (1984). Other materials are 
harder to characterize because their plasmon peaks are broad and occur within a 
limited range, typically 15—25 eV, yet by careful comparison with low-loss spectra 
recorded from candidate materials, it is sometimes possible to identify an unknown 
phase. This fingerprinting method was used to identify 25-250 nm precipitates 
in internally oxidized Si/Ni alloy as amorphous SiO2 (Cundy and Grundy, 1966) 
and 10-100 nm precipitates in silicon as SiC (Ditchfield and Cullis, 1976). More 
recently, Evans et al. (1991) quantified the depth profile of aluminum in spinel 
implanted with 2-MeV AI" ions, the depth-dependent low-loss spectrum being fitted 
to reference spectra of aluminum and spinel; see Fig. 5.9. 
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Fig. 5.9 (a) Low-loss spectra 
with plural scattering 
removed: A = metallic Al, B 
= undamaged spinel, C = 
material at implant depth of 
1.6 um, D = best fit from 
multiple regression analysis, 
indicating 3.7 + 0.6 vol.% Al. 
(b) Profile showing aluminum 
concentration as a function of 
depth. From Evans et al. 
(1991), with permission from 0 10 20 30 40 50 
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Because of plural scattering, the overall shape of the low-loss spectrum depends 
on specimen thickness, so to avoid errors due to thickness differences between 
unknown and reference materials, plural scattering should be removed, for example, 
by Fourier log deconvolution. A published library allows comparison of low-loss 
spectra with those of some common materials and is now available in digital form 
(Ahn, 2004). 

Plasmon peak width can sometimes be indicative. Chen et al. (1986) showed 
that the plasmon peak measured from quasi-crystalline AleMn was wider (3.1 eV) 
than that of the amorphous (2.4 eV) or crystalline phases (2.2 eV), suggesting that 
the icosahedral material had a distinct electronic band structure, favoring plasmon 
decay via interband transitions. On the other hand, Levine et al. (1989) found the 
plasmon widths of icosahedral Pds59U21/Si29 and Al7sCujs5 Vio to be the same as 
those of amorphous materials of the same composition. 

Many organic compounds provide a distinctive fine structure at energies below 
the main plasmon peak (Hainfeld and Isaacson, 1978); see Fig 5.10. Even if the 
structure is not prominent, a careful comparison using multiple least-squares fit- 
ting can allow the composition to be measured. Sun et al. (1993) used this method 
to determine the water content of cryosectioned red blood cells as 70 + 2%, 
with water and frozen solutions of bovine serum albumin (BSA) as standards. 
Acceptable statistical errors in the MLS fitting were obtained with an electron dose 


Fig. 5.10 Low-loss fine 
structure in the energy-loss 
function measured from 
evaporated thin films of three 
pyramidines (cytosine, uracil, 


308 5 TEM Applications of EELS 
and thymine) and two purines 
(guanine and adenine). The 


effect of electron irradiation 

is also shown. Reprinted with Uracil 

permission from Isaacson . 
(1972a). Copyright 1972, pone 
American Institute of Physics 


Thymine 
Adenine 
0 10 20 30 «0 


10 20 30 
Energy Loss (eV) —> 


of around 107 !2 C, allowing measurements on areas down to 100-nm diameter with- 
out bubbling or devitrification if the specimen was held at — 160°C. More recently, 
Yakovlev et al. (2010) applied similar methods to study the water distribution in 
frozen hydrated skin tissue, achieving a spatial resolution of 10 nm, largely deter- 
mined by radiation damage; the measurement dose was of the order of 1 C/cm?. 

Even at 1-eV resolution, different chromophores are distinguishable on the basis 
of their distinctive low-loss spectrum (Reimer, 1961) and some of these dyes can 
be used to selectively stain biological tissue. Electron spectroscopy or energy- 
selected imaging can then provide spatial resolution superior than obtainable with 
light-optical techniques (Jiang and Ottensmeyer, 1994). Since the energy losses 
involved are in the visible or near-UV region of the optical spectrum (below 5 eV), 
the spatial resolution will be limited by delocalization of the inelastic scattering 
(Section 5.5.3); measured values down to 1.6 nm have been reported (Barfels et al., 
1998). 

Spectral fine structure is more easily distinguished if the spectrometer system 
offers sub-electron volt energy resolution, making a monochromated TEM a pre- 
ferred option. Alternatively, a field-emission tip can provide 0.3-eV resolution at 
low emission current or 0.16 eV with Fourier sharpening (Batson et al., 1992). 


5.2 Low-Loss Spectroscopy 309 
5.2.2 Measurement of Plasmon Energy and Alloy Composition 


When an alloying element is added to a metal, the lattice parameter and/or valency 
may alter, leading to a change in valence electron density and plasmon energy Ep, 
as discussed in Section 3.3.1. The shift in E, can be determined experimentally for a 
given alloy system, using calibration samples of known composition. In some cases, 
the plasmon energy varies linearly with composition x: 


Ep(x) = Ep(0) + x(dEp/dx) (5.13) 


For Al and Mg alloys, whose sharp plasmon peaks make small energy shifts eas- 
ier to measure, dE,/dx is in the range —4 to +6 (Williams and Edington, 1976). 
Least-squares fitting enables the mean energy of the peak to be determined with 
good accuracy (Wang et al., 1995). For aluminum and magnesium alloys, Hibbert 
and Eddington (1972) achieved better than 0.1 eV accuracy using photographic 
recording and a tungsten filament electron source. 

To maximize the signal/noise ratio, the specimen thickness should be roughly 
equal to the plasmon mean free path (Johnson and Spence, 1974), so ultrathin spec- 
imens should be avoided. But in thicker specimens, the tail of the double-scattering 
peak displaces the first-plasmon peak toward higher energy loss; deconvolution is 
necessary to remove plural scattering and ensure repeatability. Surface oxide or con- 
tamination layers also cause a shift in peak position but are minimized by careful 
specimen preparation and clean vacuum conditions. If the specimen is crystalline, 
strongly diffracting orientations should be avoided (Hibbert and Edington, 1972). 

The plasmon-shift method has been used to demonstrate solute depletion at 
grain boundaries, to estimate diffusion constants, and to examine solute redistri- 
bution in splat-cooled alloys (Williams and Edington, 1976). A spatial resolution 
better than 10 nm was achieved, partially limited by the localization of inelastic 
scattering. From processing of spectrum image data, Hunt and Williams (1991) con- 
structed plasmon-shift images to directly reflect elemental composition. Tremblay 
and L’Esperance (1994) used the same technique to measure the volume fraction 
of Al(Mn, Si, Fe) precipitates in aluminum alloys. McComb and Howie (1990) 
used low-loss analysis to study the de-alumination of zeolite catalysts, which are 
damaged by electron doses beyond about 6 C/cm?. 

The plasmon-shift method has also been applied to metal—hydrogen systems, 
where hydrogen usually introduces an upward shift in the plasmon energy. For these 
systems, the free-electron formula gives plasmon energies that are generally too 
low by 1-3 eV, but predicts differences between the metal and hydride free-electron 
values that sometimes agree well with observation (Colliex et al., 1976b; Zaluzec 
et al., 1981; Zaluzec, 1992). Woo and Carpenter (1992) investigated the zirconium 
hydride system and found the plasmon energy to be higher in 5- and ¢-hydrides 
than in the y-hydride, allowing them to identify small precipitates in the Zr/Nb 
alloys used in nuclear reactor pressure tubes. The importance of these plasmon-shift 
studies lies in the fact that dispersed hydrogen cannot be detected by WDX or EDX 
spectroscopy, while quantification of lithium is difficult by core-loss EELS or EDX 
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methods. Hydrogen in Ti, V, and Nb specimens may also introduce weak energy- 
loss peaks in the 4—7 eV region, presumably by creating a band of states below the 
conduction band (Stephens and Brown, 1980; Thomas, 1981). 

Oleshko et al. (2002) have shown that the plasmon energy displays a good cor- 
relation with mechanical properties such as elastic, bulk, and shear modulus, and 
they used this fact to determine the properties of small precipitates in Al—Cu alloys 
(Oleshko and Howe, 2007). The plasmon energy Ep depends on the effective mass 
of the participating electrons, as well as their concentration n, so a measurement of 
Ep and n (from Kramers—Kronig analysis) can provide an estimate of effective mass 
(Gass et al., 2006a). 


5.2.3 Characterization of Small Particles 


Because small particles have a high surface/volume ratio, their excitations are 
dominated by localized surface plasmon modes, as discussed in Section 3.3.7. 
Surface-mode scattering involves energy losses below that of the volume plasmon 
peak (Section 3.3.5); the details of this scattering depend on the geometry of the 
interface and the mismatch in the energy-dependent dielectric constant. In the case 
of a small probe and a single spherical particle, the low-loss spectrum depends on 
the probe position and is different for a metallic sphere and a sphere covered with an 
oxide layer. Ugarte et al. (1992) observed an additional peak in the 3-4 eV region, 
which they attributed to a thin conducting spherical shell outside the oxide layer, 
perhaps caused by oxygen depletion by the electron beam. 

Plasmonic modes in small particles have important applications in optical signal 
processing, biosensing, and cancer therapy, and a modern (S)TEM-EELS system 
provides a good tool studying these modes. In the case of triangular silver particles, 
Nelayah et al. (2007) showed that the lowest energy (1.75 eV) mode has maxi- 
mum amplitude at the corners of the particles, a 2.7 eV mode at the edges, and a 
3.2 eV mode around the geometrical center, as shown in Fig. 5.1la. Simulation 
using a boundary element calculation gave very similar intensity distributions 
(Fig. 5.11b). 

Similar STEM data on silver nanorods are shown in Fig. 5.12. Plasmon reso- 
nance peaks are observed at energy losses ranging from the infrared (0.55 eV) to 
the ultraviolet (3.55 eV) region of the electromagnetic spectrum. In Fig. 5.12d, the 
EELS and ADF image information are combined to show the resonance modes with 
m = | — 5. These modes were in good agreement with those predicted using the 
CYLNDRCAP option of the discrete-dipole DDSCAT Fortran code (Draine et al., 
1994). ! 

Analogous images can be obtained in fixed-beam EFTEM mode, as demonstrated 
for gold nanoparticles by Schaffer et al. (2008). The STEM technique is generally 
preferable in terms of energy resolution, whereas EFTEM allows a larger field of 
view in a short acquisition time (Schaffer et al. 2010). Surface plasmon modes 


'Freeware available from http://www.astro.princeton.edu/~draine/DDSCAT. html 
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Fig. 5.11 (a) STEM energy-filtered images of silver particles recorded at energy losses of 1.8, 2.9, 
and 3.2 eV. The white bars are 30 nm long. (b) Intensity maps computed by the boundary-element 
method for similar energies. Reproduced from Nelayah et al. (2007), copyright Nature Publishing 
Group 


within arrays of holes (made by FIB machining) have also been visualized (van 
Aken et al., 2010). 


Silicon nanoparticles could provide a light emission system compatible with sil- 
icon processing technology. Yurtsever et al. (2006) fabricated a dispersion of Si 


_ 7] ——— | 


(d) 


thickness 


i=) 


= 


200 
energy loss E(eV) x(nm) 


400 600 


Fig. 5.12 (a) Thickness map of a silver rod (465 nm long, 24 nm diameter), obtained from (b) the 
STEM-ADF image. (c) Summed spectrum from the entire nanoantenna, showing resonance modes 
(m = 1—5) and a bulk plasmon loss just below 4 eV. (d) Intensity along the rod as a function of 
energy loss. Courtesy of D. Rossouw and G. Botton, McMaster University. See also Rossouw et al. 
(2011) 
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particles, by annealing silicon oxide films formed by chemical vapor deposition of 
silane. Cross-sectional TEM samples showed little contrast, the mean free paths 
for Si and SiOz being very similar, but the plasmon energies differ substantially: 
17 eV in Si compared to 23 eV in SiOz. Energy-selective tomography was used 
to provide a three-dimensional visualization, by recording a series of TEM images 
at specimen tilts up to +60° in 4° increments. Computer software allowed align- 
ment and reconstruction of the 17 + 2 eV images in three dimensions, as shown in 
Fig. 5.13a. The particles themselves had soft outlines, as expected from the delo- 
calization of plasmon scattering, but they were delineated by choosing a threshold 
intensity whose contour is represented in Fig. 5.13 as a mesh image, revealing the 
irregular shape of the particles. This complex morphology is consistent with the 
broad spectral range of photoluminescence and electroluminescence observed in 
this material, while the large surface area of each particle may account for the high 
efficiency of light emission. 

Similar tomographic techniques, exploiting differences in plasmon energy, have 
been applied to generate three-dimensional images of carbon nanotubes within a 
polymer (nylon) matrix (Gass et al., 2006b). It is also possible to combine the 
plasmon signal from a light element with a simultaneously acquired HAADF sig- 
nal (from a heavy element) to show the distribution of both. This is illustrated in 
Fig. 5.13b, which is a composite tomographic image recorded from a needle-shaped 
specimen prepared by FIB milling (Li et al., 2009). 

Alexander et al. (2008) used TEM-EELS to study the properties of individual car- 
bon particles (130-600 nm diameter) collected at various altitudes above the Yellow 
Sea. Kramers—Kronig analysis revealed that most particles could be categorized as 
brown rather than black (strongly absorbing) carbon, with implications for climate 


Fig. 5.13 (a) Tomographic reconstruction of silicon particles in silicon oxide. White “fog” repre- 
sents the 17-eV plasmon-loss intensity; the Si particles are rendered as mesh images at a constant 
intensity threshold. Reprinted with permission from Yurtsever et al. (2006). Copyright 2006, 
American Institute of Physics. (b) Composite image showing the three-dimensional distribution 
of silicon (green, from plasmon signal) and erbium (red, from ADF signal) in an Er-doped silicon- 
rich oxide film. Courtesy of P. Li, X. Wang, A. Meldrum, and M. Malac, National Institute of 
Nanotechnology and University of Alberta 
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Fig. 5.14 Real part ¢; of the 
dielectric function for various 
film thicknesses (in 
monolayers) within a 
molybdenum/vanadium 
superlattice, compared with 
the results of an optical 
thick-layer calculation. From 
Zaluzec (1992), copyright 
TMS Publications 
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change models. K-edge intensity measurements gave a range of 1.4—1.6 for the spe- 
cific gravity. Aerosol particles are usually characterized by light-optical techniques, 
but often within a limited spectral range and only as an average over many particles. 

If transmission measurements are made with the electron beam parallel to an 
interface, surface-mode contributions are maximized. Differences in scattering are 
further amplified by performing Kramers—Kronig analysis, as shown for the metal 
multilayer system in Fig. 5.14. As the layer spacing decreases, the structure departs 
from that calculated from bulk properties, possibly indicating structural transfor- 
mation to a strained layer superlattice (Zaluzec, 1992). Turowski and Kelly (1992) 
recorded low-loss spectra as a function of position across Al/SiO2/Si field-effect 
transistor structures and computed the dielectric function at each position of the 
STEM probe, as well as the electronic polarizability w.(£), which may be a measure 
of dielectric strength. The maximum polarizability and the energy Emax of this max- 
imum were lower near the Al and Si interfaces (Fig. 5.15), suggesting that contact 
materials reduce the dielectric strength in very thin oxides. 


Fig. 5.15 Molar 
polarizability we within a 
Al/SiO>/Si heterostructure 
and the energy position Emax 
of the maximum in ae, 
derived from energy-loss 
spectroscopy. From Turowski 0 
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Fig. 5.16 Low-loss spectra 
recorded using the specular 
(400) reflection from a (100) 
Mg6O surface heated to 
various temperatures in the 
TEM. From Wang (1993), 
copyright Institute of Physics. 
http:/Aopscience.iop.org/ 
0034-4885/56/8/002 


MgO (400) 
specular reflection 


0 10 20 30 
energy loss (eV) 


As discussed in Section 3.3.6, energy-loss spectra can be obtained in reflection 
mode within the TEM. Figure 5.16 shows reflection low-loss spectra from a MgO 
specimen raised to successively higher temperatures. Above 500 K, the 10 eV peak 
started to disappear; reflection K-loss spectra indicated a depletion of surface oxy- 
gen atoms. Above 1400 K, the energy of the main plasmon peak and core-loss 
spectroscopy suggested formation of a surface layer of MgO, a few nanometers 
thick, which remained after subsequent cooling and exposure to air. 


5.3 Energy-Filtered Images and Diffraction Patterns 


As demonstrated in Chapter 2, the electron spectrometer in a TEM can act as a band- 
pass energy filter: inserting an energy-selecting slit at the spectrum plane results in 
an electron image or diffraction pattern from a chosen range of energy loss. This pro- 
cedure provides information in a convenient form; for example, a core-loss image 
can indicate the spatial distribution of a particular element. 

As discussed in Section 2.6, energy filtering is possible not only in a fixed-beam 
(conventional) TEM instrument, by using an imaging filter within or below the 
TEM column, but also in a scanning transmission (STEM) microscope. The various 
modes of operation are discussed in Chapter 2 and by Reimer et al. (1988, 1992). A 
detailed discussion of the contrast mechanisms in energy-filtered images is given by 
Spence (1988), Reimer and Ross-Messemer (1989, 1990), Bakenfelder et al. (1989), 
and Reimer (1995). Colliex et al. (1989) discuss energy-filtered STEM imaging of 
thick biological specimens. Energy-filtered diffraction is treated by Spence and Zuo 
(1992). 
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5.3.1 Zero-Loss Images 


By operating the spectrometer in a diffraction-coupled mode (the TEM set for imag- 
ing) and adjusting the spectrometer excitation or accelerating voltage so that the 
zero-loss peak passes through the energy-selecting slit, a zero-loss image is pro- 
duced with greater contrast and/or resolution than the normal (unfiltered) image; 
see Figs. 5.17 and 5.20. The main factors responsible for such improvement are 
listed below; their relative importance varies according to the type of specimen and 
whether a conventional TEM or STEM is involved. Some factors can be identified 
as affecting the resolution and others the contrast of an image, although in the case 
of small-scale repetitive features these two concepts are closely related (Nagata and 
Hama, 1971). 


5.3.1.1 Chromatic Aberration and Contrast-Reducing Effects 


In the conventional (fixed-beam) TEM, an energy-selecting slit centered on the zero- 
loss peak eliminates most of the inelastically scattered electrons, greatly reducing 
chromatic broadening of the image. For a very thin specimen and a low energy 
loss E, the inelastic image corresponding to an energy loss E is blurred relative to 
the elastic image by a Lorentzian point-spread function whose half-width is 2rg ~ 
20¢C,(E/Ep), and since 0¢ © 0.5E/Ep, the image resolution is approximately 


dp © C-(E/Eo) (5.14) 


Fig. 5.17 (a) Unfiltered and (b) zero-loss micrographs of a 40-nm epitaxial gold film, recorded 
with 80-keV electrons and 10-mrad objective aperture. Energy filtering increased the crystallo- 
graphic contrast by a factor of two 
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where C, is the chromatic aberration coefficient of the objective lens. In the case of 
100-keV incident electrons, C. = 2 mm and E * 37 eV (average energy loss per 
inelastic event for carbon), dg ~ 0.3 nm. For a large energy loss (e.g., core loss), 
the chromatic point-spread function is given by Eq. (2.25); the chromatic broaden- 
ing depends on the semi-angle 6 of the objective aperture in CTEM but is several 
times less than the overall diameter 6C,(E/Eo) of the chromatic disk, as indicated 
in Fig. 2.17. 

As the specimen thickness increases, an increasing fraction of the electrons 
are inelastically scattered and plural scattering causes the average energy loss 
to increase, while plural elastic/inelastic scattering further broadens the inelastic 
angular distribution. The objective aperture is then “filled” with scattering and the 
chromatic broadening is closer to do © BC.E/Eo (® 7 nm for 6 = 10 mrad). 
As a result of these various factors, chromatic aberration becomes more serious in 
thicker specimens. Zero-loss filtering then substantially improves the image contrast 
and resolution, particularly in organic specimens where inelastic scattering is strong 
relative to elastic scattering (Section 3.2.1). 

In the case of high-resolution phase-contrast imaging, inelastic scattering is 
often assumed to produce a structureless background that reduces image contrast, 
although if the low-loss spectrum contains sharp plasmon peaks, this plasmon scat- 
tering could produce image artifacts (Krivanek et al., 1990). In general, energy 
filtering permits a more quantitative comparison of image contrast with theory 
(Stobbs and Saxton, 1988), especially if allowance is made for the point-spread 
function of the image-recording CCD camera (Thust, 2009). The situation should 
be further improved with the deployment of multipole devices that correct for both 
spherical and chromatic aberration of TEM imaging lenses. 

For the examination of thick specimens, energy-filtered microscopy (EFTEM) 
with 80 or 100-keV electrons is therefore an alternative to the use of higher accel- 
erating voltages, where chromatic aberration is reduced in proportion to 1/Eo” 
according to Eq. (5.14). However, zero-loss filtering reduces the image intensity 
by a factor of exp (#/A), where A is the total inelastic mean free path, limiting the 
maximum usable specimen thickness to about 0.5 j1m at 80-keV incident energy. 

Staining of biological tissue creates regions containing a high concentration of 
heavy-metal atoms surrounded by material comprised mainly of light elements (H, 
C, O), and the resulting strong variations in elastic scattering power provide usable 
contrast. Because the inelastic/elastic scattering ratio is high for light elements, elec- 
tron scattering in unstained regions is mainly inelastic and is therefore removed by 
energy filtering, leading to a further improvement in contrast. Reimer and Ross- 
Messemer (1989) reported that the contrast of large-scale features in OsO4-stained 
myelin was increased by a factor of 1.3 after zero-loss filtering. 

Because unstained biological specimens provide very low contrast, the image is 
often defocused to create phase contrast. Langmore and Smith (1992) found that 
zero-loss filtering increased the image contrast from air-dried and frozen hydrated 
TMV images by factors between 3 and 4. This improved contrast allows a reduc- 
tion in the amount of defocusing, allowing better spatial resolution and increased 
signal/noise ratio or reduced electron dose to the specimen (Schroder et al., 1990). 
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In the case of crystalline specimens, defects are visible through diffraction con- 
trast that arises from variations in the amount of elastic scattering, depending on 
the local excitation error (deviation of lattice planes from a Bragg reflecting orien- 
tation). The angular width of inelastic scattering creates a spread in excitation error, 
reducing the contrast of dislocations, planar defects, bend contours, and thickness 
contours (Metherell, 1967), so diffraction contrast is again improved by zero-loss 
filtering. Higher incident energy also provides less spread in excitation error and 
chromatic aberration; Bakenfelder et al. (1989) concluded that, for a 500-nm Al 
film, zero-loss filtering is equivalent (in terms of image quality) to raising the micro- 
scope voltage from 80 to 200 kV. Making use of zero-loss filtering and of the 
increased transmission (channeling) that occurs when a crystal is oriented close to a 
zone axis, Lehmpfuhl et al. (1989) obtained clear 80-keV images of dislocations in 
gold films as thick as 350 nm. 

At high energy loss or large scattering angle, inelastic scattering in crystals is 
believed to be partly interbranch: the character of the electron Bloch wave changes 
upon scattering and Bragg contrast is no longer preserved (Hirsch et al., 1977). 
Although zero-loss filtering removes such scattering, the overall effect appears neg- 
ligible because the probability of such scattering is low. Phonon excitation also 
causes interbranch scattering, but because the energy losses are below 0.1 eV, it 
is not removed by energy filtering. 


5.3.2 Zero-Loss Diffraction Patterns 


Energy filtering of diffraction patterns can be accomplished with an imaging fil- 
ter in a conventional TEM (Section 2.6), much more efficiently than scanning 
a diffraction pattern across the entrance aperture of a nonimaging spectrometer 
(Graczyk and Moss, 1969). Zero-loss filtering removes the diffuse background aris- 
ing from inelastic (mainly plasmon) scattering and makes faint diffraction features 
more visible (Midgley et al., 1995). Since the inelastic scattering is strongest at 
small angles (Fig. 3.7), filtering should be particularly advantageous for low-angle 
diffraction, improving the analysis of materials with large unit cell, such as periodic 
arrays of macromolecules or nonperiodic nanostructures. Filtering has also been 
used to improve the visibility of reflection diffraction patterns recorded in a TEM 
(Wang and Cowley, 1994) and to facilitate the quantitative analysis of intensities in 
convergent-beam diffraction; see Fig. 5.18. 

Removal of inelastic scattering also facilitates the quantitative analysis of amor- 
phous materials (Cockayne et al., 1991). After subtracting a smoothly varying 
atomic scattering factor, Fourier transformation of the zero-loss diffraction pattern 
leads to a radial density function (RDF) whose peak positions provide interatomic 
spacings to an accuracy of typically 5 pm. This technique allowed Liu et al. (1988) 
to demonstrate that the first- and second-neighbor distances in amorphous silicon 
alloys decrease by up to 40 pm after doping with boron and phosphorus. 

Zero-loss filtering of convergent-beam diffraction (CBED) patterns, combined 
with quantitative comparison between experimental and theoretical diffraction 
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Fig. 5.18 Two-beam (220) CBED patterns of silicon: (a) no energy filtering and 1-s exposure, 
(b) zero-loss filtering and 3-s exposure. From fringe spacings, the thickness was calculated to be 
270 nm (Mayer et al., 1991). From Mayer et al. (1991), copyright San Francisco Press, Inc., with 
permission 


intensities using the least-squares refinement technique, has made it possible to 
derive structure factor amplitudes and phases with accuracies sufficient to detect 
small changes in crystal electron density due to chemical bonding (Zuo, 2004). 
Temperature factors can be measured (Tsuda et al., 2002) and also small amounts 
of lattice strain. For lattice strain measurement, zero-loss filtering significantly 
improves the visibility of higher order Laue zone (HOLZ) lines in the central disk 
(direct beam) of a CBED pattern. The lattice parameters of crystalline TEM spec- 
imens can be obtained with 0.1 pm accuracy, sufficient to measure small strains 
in silicon memory devices (Kim et al., 2004). The use of a field-emission elec- 
tron source allows CBED data to be acquired from specimen areas less than 1 nm 
in diameter (Xu et al., 1991). Even in the Tanaka LACBED method, in which a 
small SAD aperture provides a degree of energy filtering, zero-loss filtering can 
substantially increase the contrast in the central region of the pattern (Burgess et al., 
1994). 


5.3.3 Low-Loss Images 


The spatial distribution of a material having a sharp plasmon peak can be displayed 
by forming an image at the corresponding energy loss. For example, Be precipitates 
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in an Al alloy appear dark in a 15-eV image (plasmon loss of Al) but brighter than 
their surroundings in an image recorded at 19 eV, the plasmon loss of beryllium 
(Castaing, 1975). 

Making use of the shift in plasmon energy upon alloying, Williams and Hunt 
(1992) processed spectrum image data to display the distribution of Al3Li precipi- 
tates in Al/Li alloys. Tremblay and L’Espérance (1994) used a similar technique to 
image Al(Mn, Si, Fe) particles in aluminum alloy, deducing the volume fraction of 
precipitates to be 0.81%. 

Inelastic scattering by surface plasmon excitation provides intensity at energies 
below the volume plasmon peak. At this energy loss, small particles show a bright 
outline because the probability of surface plasmon excitation is larger for an electron 
that travels at a glancing angle to the surface (Section 3.3.6). If an unidentified peak 
is seen in the low-loss spectrum of an inhomogeneous specimen, forming an image 
at that energy loss can help to determine if it arises from surface-mode scattering. 

Some organic dyes (chromophores) have absorption peaks at energies of a few 
electron volts, corresponding to visible or UV photons, and can be used as chemi- 
cally specific stains in light microscopy. By forming an image at the corresponding 
energy loss, their distribution can be mapped with high spatial resolution in an 
energy-filtering electron microscope (Jiang and Ottensmeyer, 1994). 


5.3.4 Z-Ratio Images 


A Z-ratio STEM image is formed by taking a ratio of the high-angle scattering 
(recorded by an annular detector) and the low-angle scattering, measured through an 
electron spectrometer that removes the zero-loss component; see Section 2.6.6. For 
very thin specimens, the dark-field signal represents mainly elastic scattering, while 
the spectrometer signal arises from inelastic scattering. Intensity in the ratio image 
is therefore a measure of the local elastic/inelastic scattering ratio, which is roughly 
proportional to the local (mean) atomic number Z; see Section 3.2.1. The aim is 
usually to distinguish differences in elemental composition, while suppressing the 
effects of varying specimen thickness and fluctuations in incident beam current. This 
technique was used by Crewe et al. (1975) to display images of single high-Z atoms 
on a very thin (<10 nm) carbon substrate and subsequently investigated for imaging 
small catalyst particles on a crystalline or amorphous support (Treacy et al., 1978; 
Pennycook, 1981). 

The Z-ratio technique has also been applied to thin sections of biological tissue 
(Garavito et al., 1982); see Fig. 5.19. If the section thickness is below 50 nm, so that 
plural scattering is not severe, contrast due to thickness variations (e.g., caused by a 
microtome) largely cancels in the ratio image, allowing small differences in scatter- 
ing power to be distinguished in unstained specimens (Carlemalm and Kellenberger, 
1982). 

The Z-contrast image may appear to have better spatial resolution than either 
the dark-field or inelastic image, but this effect occurs because the inelastic image 
is blurred by delocalization (Section 5.5.3); upon division of the two intensities, 
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Fig. 5.19 T4 bacteriophages adsorbed to E. coli in a 30-nm section embedded in a 24% Sn resin 
(1.1-jm field of view). (a) Annular dark-field image; (b) inelastic/elastic Z-ratio image in which 
regions of lower atomic number appear bright. From Carlemalm et al. (1982), copyright Elsevier 


high-frequency components of the dark-field image are emphasized, equivalent to 
unsharp masking of photographic negatives (Ottensmeyer and Arsenault, 1983). If 
the contrast in the elastic or inelastic signals is too high, the nonlinear process of 
division can create image artifacts (Reichelt et al., 1984). 


5.3.5 Contrast Tuning and MPL Imaging 


Contrast tuning denotes the ability to choose an energy loss (typically in the range 
0-200 eV) where contrast is adequate but low enough for the image to be recorded 
in a single micrograph (Bauer et al., 1987; Wagner, 1990). Dynamic range is some- 
times a problem in zero-loss images of thick (e.g., 0.5 jum) sections of biological 
tissue because stained regions scatter very strongly relative to unstained ones. 

Structure-sensitive contrast in biological tissue can sometimes be maximized 
by choosing an energy loss around 250 eV, just below the carbon K-edge, so that 
the contribution of carbon to the image is minimized. Structures containing ele- 
ments with lower-lying edges (sulfur, phosphorus, or heavy-metal stain) then appear 
bright in the image, giving a reversed “dark-field” contrast; see Fig. 5.20. Imaging at 
260 eV has been used to observe microdomain morphology in unstained polymers 
(Du Chesne et al., 1992). 

The most probable loss (MPL) is the energy loss at which the spectral intensity 
is highest. For thin specimens (t/A < 1), the zero-loss peak is the most intense, 
but in thick specimens this peak is reduced and the MPL corresponds to the broad 
maximum of the Landau distribution (Fig. 3.30), around 80 eV for 0.5-jzm Epon 
and 270 eV for a 1-jum section (Reimer et al., 1992). An image obtained at this 
loss will have maximum intensity, a desirable property for accurate focusing and 
short recording times, so that specimen drift and radiation damage are minimized. 
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Fig. 5.20 A 30-nm section showing HIV-producing cells embedded in Epon after glutaralde- 
hyde and OsQ, fixation and staining with uranyl acetate. (a) Unfiltered image (the bar measures 
100 nm); (b) zero-loss image showing improved contrast; (c) 110-eV image showing reduced con- 
trast; (d) 250-eV image showing structure-sensitive reversed contrast. From Ozel et al. (1990), 
copyright Elsevier 


Intensity is also increased by widening the energy-selecting slit, but at the expense 
of loss of spatial resolution due to higher chromatic aberration in EFTEM images. 

Pearce-Percy and Cowley (1976) showed that STEM images of thick biolog- 
ical objects can be obtained with near-optimum signal/noise ratio if an electron 
spectrometer is used to accept all energy losses below or above the MPL (giving 
bright-field and dark-field energy-loss images, respectively). With MPL = 150 eV, 
they obtained 100-keV dark-field images with high contrast from 1-jym-thick chick 
fibroblast nuclei. 


5.3.6 Core-Loss Images and Elemental Mapping 


The ability to display two-dimensional distributions of specific elements makes 
the TEM imaging filter a powerful tool in materials analysis. As discussed in 
Section 2.6.5, elemental mapping involves recording at least two images, before 
and after the ionization edge. The simplest procedure is to subtract the pre-edge 
and post-edge images; see Fig. 5.21 This two-window procedure works well enough 
for edges with high jump ratio (obtained from very thin specimens and high con- 
centrations of the analyzed element) but is unsatisfactory when quantitative results 
are required (Leapman and Swyt, 1983). Negative intensities can be generated in 
regions devoid of the selected element (Crozier, 1995). 

Another simple procedure involves dividing the post-edge and pre-edge images 
(Section 2.6.5), yielding a jump-ratio image that is largely insensitive to variations 
in specimen thickness and diffracting conditions. The diffraction contrast is fur- 
ther suppressed by using rocking-beam illumination (Hofer and Warbichler, 1996), 
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Fig. 5.21 A 30-nm microtomed section of a photographic emulsion, showing silver halide micro- 
crystals with a AgBr core and AgBrI shell. (a) Silver-M45 map obtained by subtracting 370 and 
360-eV images. (b) lodine-M45 image from subtraction of 615 and 580-eV images. From Lavergne 
et al. (1994), copyright Les Editions de Physique 


as illustrated in Fig. 5.22, where carbide precipitates become highly visible in a 
jump-ratio image recorded at the M23-edge. For very thin specimens, the ratio-image 
intensity would be proportional to elemental concentration, but in specimens of typ- 
ical thickness, plural scattering background components make the ratio image only 
a qualitative indication of elemental distribution (Hofer et al., 1995; Crozier, 1995). 

For quantitative elemental mapping, it is desirable to record at least two pre- 
edge images. If the two energy windows are adjacent to each other, as in Fig. 1.11, 
Eqs. (4.51)-(4.53) or least-squares fitting can be used to evaluate the background 
parameters A and r, from which the background contribution to the post-edge 


Fig. 5.22. EFTEM micrographs of ferritic-martensitic 10% Cr steel containing W and Mo. (a) 
Zero-loss image, showing strong diffraction contrast, (b) Fe-M23 jump-ratio image recorded with 
conical rocking-beam illumination, showing precipitates at grain boundaries. From Warbichler 
et al. (2006), copyright Elsevier 
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Fig. 5.23.) EFTEM images (Ep = 200 keV, 6 = 7.6 mrad) of an ion-thinned foil of ODS-niobium 
alloy containing 0.3 at.% Ti and 0.3 at.% oxygen. Images (a) and (b) were recorded with 20-eV 
windows below the Ti L3 edge, (c) with a 20-eV window just above the Ti-edge threshold. (d) Ti 
is an elemental map obtained by three-window modeling, (e) a Ti-edge jump-ratio image, and (f) 
an image formed from the ratio of the two pre-edge images. From Hofer et al. (1995), copyright 
Elsevier 


image can be calculated by the image-acquisition computer. An example is shown 
in Fig. 5.23. The two pre-edge images and the post-edge image all look similar, 
but when they are combined to form an elemental map, titanium oxide precipi- 
tates become visible and bend contour contrast within the foil largely disappears. 
Diffraction contrast is further suppressed by dividing by a low-loss or zero-loss 
image (Crozier, 1995). As further confirmation that a jump-ratio or three-window 
image represents elemental concentration, taking the ratio of the two pre-edge 
images should yield very little contrast. This is the case in Fig. 5.23f except along 
the bend contour, where intensity fluctuations may have arisen from a small change 
in specimen orientation between acquisition of the two images. 

Because of background extrapolation errors (Section 4.4.4), three-window mod- 
eling produces a noisier image than the two-window methods (subtraction or 
division), as seen by comparing images (d) and (e) in Fig. 5.23. To achieve ade- 
quate statistics, the recording time can be increased or the number of image pixels 
reduced. A good strategy is to first acquire a jump-ratio image from the area 
of interest, requiring a relatively short exposure time. If the results are encour- 
aging, the three-window method can then be used to obtain a more quantitative 
elemental map. 
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Fig. 5.24 Atomic resolution images of LaBg recorded in STEM mode with 120-mrad collection 
semi-angle, using a 200-kV TEM with the probe-forming and imaging lenses corrected for spheri- 
cal aberration. Lanthanum atom columns appear bright in the ADF image (a) and the La-N4s5 image 
(b) but boron columns do not show up in the images (¢ and d) recorded with energy losses above 
the B K-edge (188 eV). Courtesy of Sorin Lazar, FEI Company 


The availability of spectrum-imaging software makes the above techniques rela- 
tively easy to implement. An extended range spectrum is recorded from each image 
pixel and can be analyzed in different ways later (Arenal et al., 2008). 

As discussed in Section 2.6.5, elastic scattering modulation of a core-loss image 
can be reduced by using a large collection angle, possible without sacrificing energy 
resolution if the post-specimen lenses are aberration corrected. Large collection 
angle maximizes the core-loss signal but gives substantially worse edge/background 
ratio, except for very thin specimens or high edge energies. Some impressive ele- 
mental maps with atomic resolution have been obtained under these conditions (e.g., 
Muller et al., 2008) but elastic scattering may exert a more subtle effect in atomic- 
scale images of crystalline specimens, as illustrated in Fig. 5.24. Here the specimen 
was aligned with the electron beam parallel to the [100] direction; La atom columns 
are visible in the STEM—ADF image and at the La-N45 (and M45) edges, whereas 
images recorded with boron-K electrons show little contrast. The proposed expla- 
nation is that electrons incident on the La columns are strongly scattered and excite 
the adjacent boron atoms, producing a strong B K-signal (Lazar et al., 2010). The 
reverse effect would be much weaker because of the small elastic scattering power 
of B atoms, as shown by their absence in the ADF image. 


5.4 Elemental Analysis from Core-Loss Spectroscopy 


As discussed in Chapter 1, EELS offers higher spatial resolution and elemental 
sensitivity than EDX spectroscopy for some specimens, while generally requiring 
more skill on the part of the operator. In this section, we discuss the data collection 
strategies that have proved effective in particular cases, to complement the gen- 
eral description of spectrum-processing techniques in Chapter 4. Situations specific 
to particular elements are discussed in this section; results from particular materials 
systems are given in Section 5.7. We begin by reviewing some choices of instrument 
and method that are directly relevant to core-loss spectroscopy. 
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Parallel recording of the energy-loss spectrum increases the speed and sensitiv- 
ity of elemental analysis. Variations in sensitivity within the CCD array have been 
minimized by improved design and computer processing (Section 2.5.5). For STEM 
instrumentation, the use of spectrum imaging (Section 2.6.4) is attractive because 
it allows extensive data processing after acquisition, including the use of principal 
component and independent component analysis. Similar complete information can 
be recorded by an energy-filtering TEM, although the radiation dose required to 
achieve the same (time-integrated) signal is higher by a factor equal to the number 
of images collected (Section 2.6.6). For some inorganic specimens, this increased 
dose is unimportant and is outweighed by the higher current available in a broad 
beam, allowing shorter recording times and less drift of the specimen and high 
voltage. 

As discussed in Chapter 2, spectroscopy can be carried out with a conventional 
TEM operating in either its imaging or diffraction mode. In image mode, a region of 
analysis of known diameter is conveniently selected by means of the spectrometer 
entrance aperture, the collection semi-angle being known if the objective aperture 
has been calibrated. However, chromatic aberration of the TEM imaging lenses can 
prevent precise selection of the analyzed area (see Section 2.3.2) and may result 
in incorrect elemental ratios (Section 2.3.3). Chromatic effects are minimized by 
changing the microscope high voltage by an amount equal to the energy loss being 
analyzed, a technique previously used for serial recording. In TEM diffraction 
mode, the diffraction pattern should be carefully centered about the spectrometer 
entrance aperture (approximately the center of the viewing screen); the collection 
angle now depends on the camera length and the aperture diameter. Provided the 
incident probe diameter is small, chromatic aberration is unimportant in diffraction 
mode. 

A small collection angle (5—10 mrad) increases the visibility (signal/background 
ratio) of an ionization edge (Section 3.5) and is appropriate for lower-energy 
edges. For higher-energy edges, the problem of low intensity is reduced by choos- 
ing larger 6. Quantitative analysis becomes problematic if strong Bragg spots (or 
rings) appear just inside or outside the aperture (Egerton, 1978a). In the case 
of a small probe, the convergence angle a may easily exceed 10 mrad, and to 
avoid considerable loss of signal 6 should exceed a (Section 4.5.3). As shown by 
Fig. 4.19, quantitative core-loss analysis involves a convergence correction unless 
a < B/2. 

The next decision is the choice of specimen region to analyze. Sometimes this is 
obvious from the TEM image (possibly aided by diffraction) but if the instrumen- 
tation allows elemental mapping, a jump-ratio image (Section 5.3.6) can be useful 
in selecting an area for more detailed analysis. Particularly for quantitative analysis 
and ionization edges below 200 eV (where plural scattering can greatly increase the 
pre-edge background), a very thin part of the specimen is needed: ideally t/A < 0.5, 
which means that the zero-loss peak contains at least 60% of the counts in the low- 
loss spectrum (Section 5.1). EELS elemental analysis is often carried out using the 
maximum available incident energy, since this is equivalent to using the thinnest 
specimen. 
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In the case of a crystalline specimen, its orientation relative to the incident beam 
has an influence on quantitative microanalysis. It is advisable to avoid strongly 
diffracting conditions, such as the Bragg condition for a low-order reflection or 
where the incident beam is parallel to a low-order zone axis. Use of a less-diffracting 
situation increases the collected signal and minimizes quantification errors that can 
arise from channeling (Section 5.6.1) and from Bragg beam contributions to the 
core-loss intensity (Section 4.5.1). It may also help to improve the spatial resolution 
of analysis by reducing beam spreading (Section 5.5.2). 

Elements of atomic number greater than 12 allow a choice of the ionization 
edge used for elemental analysis. In general, only a major edge (listed in italics in 
Appendix D) is suitable, and those with a threshold energy in the range 100-2000 eV 
are preferable. Edges that are sawtooth shaped or peaked at the threshold (denoted h 
and w in Appendix D) are more easily identified and quantified, especially if the ele- 
ment occurs in low concentration. Ionization cross sections of K-edges are mostly 
known to within 10% but the situation for other edges is more variable (Egerton, 
1993). 

Quantification of the core-loss signal requires its separation from the background. 
The simplest procedure is to model the pre-edge background by fitting within a 
pre-edge region, making allowance for this background when integrating the core- 
loss signal over an energy window (typically 50-100 eV) following the edge; see 
Sections 4.4.1 and 4.5.1. This procedure becomes problematic when two or more 
edges are close in energy or when an element is present at low concentration, or 
at a low-energy edge in a specimen that is not extremely thin. In such cases, the 
core-loss signal is more successfully modeled by multiple least-squares (MLS) fit- 
ting to the background and edge components; see Section 4.5.4. Usually standard 
specimens are used to record edge shapes while calculated cross sections are used to 
derive elemental ratios. Another tactic is to investigate differences in concentration 
by subtracting spectra recorded from nearby regions of specimen (Section 4.5.4). 

To determine elemental ratios, a choice must be made between a standardless 
procedure (using calculated or parameterized cross sections; see Appendix B) and 
a standards-based (k-factor) method. The standardless approach is convenient, but 
requires that the collection semi-angle be known, at least approximately. The k- 
factor method uses one or more standard specimens of known composition that 
incorporate each analyzed element. The incident beam energy and collection angle 
are not needed, provided the same values are used for the unknown and standard 
specimens. Some sources of systematic error, such as poorly known cross sec- 
tions and chromatic aberration effects, cancel when using the k-factor method. 
Standards that have been found useful include apatite (CasP30,2F) and rhodizite 
(K46Cs3¢RbeNazAl399Beq455B 1 135Li2O2s). 

Although core-loss spectroscopy can in principle identify any element in the peri- 
odic table, some are more easily detected than others. EELS is most commonly 
used for analyzing elements of low atomic number, which are difficult to quan- 
tify by EDX spectroscopy. In the following section, we show how EELS has been 
employed to detect or quantify specific light elements. 
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5.4.1 Measurement of Hydrogen and Helium 


Hydrogen in its elemental form is observable from the presence of an ionization 
edge. Although the ionization energy is 13.6 eV, this value corresponds to transitions 
to continuum states of an isolated atom. At slightly lower energy loss, a Lyman 
series of transitions to discrete levels gives peaks that may not be resolved in a 
TEM spectrometer systems, the result being a structureless edge with a maximum at 
about 12 eV, followed by a gradual decay on the high-loss side (Ahn and Krivanek, 
1983). Energy-loss spectroscopy can be used to measure the composition of gases 
(including H2) inside a TEM environmental cell, to an accuracy of 15% (Crozier and 
Chenna, 2011). EELS has also detected molecular hydrogen present as bubbles in 
ion-implanted SiC (Hojou et al., 1992) and in frozen hydrated biological specimens 
after irradiation within the TEM (Leapman and Sun, 1995). Bubbles do not form if 
the specimen is sufficiently thin, suggesting that the hydrogen can diffuse out even 
at -170°C (Yakovlev et al., 2009). 

Hydrogen chemically combined with other elements transfers its electrons to the 
whole solid, destroying the energy levels that would give rise to a characteristic 
ionization edge. Nevertheless, metallic hydrides have been detected from their low- 
loss spectra; electrons donated by H atoms usually increase the valence-electron 
density, shifting the plasmon peak upward by | or 2 eV from that of the metal; 
see Section 5.2.2. In minerals, an oxygen K-edge prepeak near 530 eV, previously 
thought to be indicative of hydrogen, has more recently been interpreted as due to 
liberation of O2 during electron irradiation (Garvie, 2010). 

Hydrogen present in an organic compound influences its low-loss spectrum. 
Hydrocarbon polymers have their main “plasmon” peak at a lower energy than 
that of amorphous carbon (~24 eV) because hydrogen reduces the mass density. 
If hydrogen is lost, for example, during electron irradiation, the plasmon energy 
increases toward that of amorphous carbon (Ditchfield et al., 1973). 

Hydrogen in an organic material also increases the inelastic/elastic scattering 
ratio n, measurable in a conventional TEM from the total intensity 7 and zero-loss 
intensity Jo in a spectrum recorded without an angle-limiting aperture, together with 
the zero-loss intensity J, recorded with a small (~2 mrad) angle-limiting aperture. 
Making allowance for plural scattering, 
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The specimen must be thick enough to avoid Jp and J, being almost equal, otherwise 
fluctuations in incident beam current result in poor accuracy. This type of measure- 
ment was used to monitor the loss of hydrogen from 9,10-diphenyl anthracene as a 
function of electron dose (Egerton, 1976a). 

Helium is produced in the form of nanometer-sized bubbles when stainless steel 
(used as fuel cladding in nuclear reactors) is irradiated with neutrons. By position- 
ing a STEM probe at a bubble and on the nearby metal matrix, McGibbon and 
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Brown (1990) recorded energy-loss spectra that revealed the helium K-ionization 
edge, after subtraction. Edge quantification using a hydrogenic K-shell cross section 
led to an estimate of the He concentration in a 20-nm bubble: 2 x 1028 atoms/m?, 
corresponding to a He pressure of 2 kbar (0.2 GPa). More recently, Fréchard et al. 
(2009) performed a systematic study of He bubbles in martensitic steel. The Fe 
plasmon peak was modeled as a Gaussian and subtracted to yield a Gaussian-like 
helium signal (Fig. 5.25), which was quantified using Hartree—Slater cross sections. 
For bubble diameters less than 5 nm, the He density matched that of liquid He 
and was three times as high for a 2-nm bubble. The helium peak blue-shifted by 
about | eV with decreasing bubble diameter, supposedly a result of Pauli repulsion 
(wavefunction overlap between adjacent atoms). 

Using energy-loss spectroscopy with a broad electron beam, Fink (1989) esti- 
mated the average pressure in He bubbles formed in Al and Ni by ion implantation. 
The excitation threshold was shifted from the free-atom value (21.23 eV) to about 
24 eV. Taking this blueshift to be proportional to He density, the He pressure P was 
found to be inversely proportional to the bubble radius r (Pr = 90 kbar nm) for 
bubbles in aluminum. In the case of implanted Ni, the pressure inside the smallest 
bubbles exceeded 250 kbar, corresponding to a density 10 times larger than liquid 
He, so the helium was assumed to be in solid form. Confirmation by electron diffrac- 
tion was hampered by the small scattering amplitude of He, but electron diffraction 
peaks have been recorded from bubbles of other rare gases ion-implanted into Al 
and have indicated epitaxy with the surrounding matrix. 
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Fig. 5.25 (a) Energy-loss 
spectrum recorded from the 
center of a He bubble in 
EM10 martensitic steel, 
showing the plasmon-fitting 
windows. (b) Subtracted He 
signal and Gaussian fit to the 
data. From Fréchard et al. 
(2009), copyright Elsevier 20 22 24 26 
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5.4.2 Measurement of Lithium, Beryllium, and Boron 


The elements Li, Be, and B give K-ionization edges in the 30-200 eV region, super- 
imposed on a relatively large background. In very thin specimens (t/A < 0.3), this 
background represents the tail of the valence electron plasmon peak; in thicker ones, 
plural plasmon scattering dominates. Hofer and Kothleitner (1993) used Fourier 
log deconvolution to improve an AE" fit to the background preceding Li and Be 
edges recorded from mineral specimens. In this energy region, a K-edge is likely 
to overlap with L- or M-edges of other elements, complicating quantitative analy- 
sis. The problem is reduced by using a small energy window (A < 50 eV), at the 
risk of systematic error due to energy-loss near-edge structure (Section 4.5.2). A 
more satisfactory solution for Li and Be edges (Hunt and Williams, 1991; Hofer 
and Kothleitner, 1993) is to employ MLS fitting to reference edges recorded from 
simple compounds that have the same coordination and similar near-edge structure. 

Lithium cannot be analyzed with current EDX detectors and is measurable by 
WDxX spectroscopy only by applying a considerable electron dose, with poten- 
tial radiation damage. Chan and Williams (1985) evaluated EELS as a means of 
quantitative analysis of Al/Li alloys, used in aerospace applications because of 
their high strength/weight ratio. To minimize the Li K-edge background, very thin 
(<50 nm) specimens and small collection angles (6 < 5 mrad) were necessary. 
The pre-edge background could then be successfully modeled by an AE~" func- 
tion and extrapolated over a 40-eV interval containing both the Li K- and Al Ly3 
edges. 

The use of lithium as a battery material has led to EELS being used, along with 
TEM imaging and diffraction, to characterize the various phases involved (Muto 
et al., 2009; Wang et al., 2010b). The main problems for Li quantification are 
double-plasmon excitation, giving low edge/background ratio intensity at the Li K- 
edge, and the sensitivity of this element to the electron beam. Radiation damage 
occurs through radiolysis and also displacement damage, since the incident-beam 
threshold energy is normally below 20 keV. Strong electron-hole interaction and rel- 
ativistic effects (in an anisotropic matrix, see Appendix A) complicate fine structure 
analysis of the edge. An incident energy of 200 keV or more helps by increasing the 
inelastic mean free path, equivalent to a thinner specimen. The Li-K near-edge struc- 
ture can be an effective tool for differentiating between lithium compounds such as 
LiC», LizCO3, and LizO and for comparing measured ELNES with calculations 
based on an assumed structure; see Fig. 5.26. 

Beryllium forms coherent precipitates in copper, giving high strength through 
age-hardening. Using EELS, Strutt and Williams (1993) found that the Be/Cu ratio 
of y-phase precipitates increased with decreasing aging temperature, contrary to 
the expected phase diagram. The lowest quantified concentration of Be in the pre- 
cipitates was about 10 at.%. In their analysis of BeO-doped SiC, Liu et al. (1991) 
avoided conventional background fitting by recording a spectrum from pure SiC. 
After scaling and subtracting this spectrum, weak features at 188 eV indicated a 
beryllium content considerably less than 1%. EELS has also been employed to 
detect 10-nm Be grains in lung tissue (Jouffrey et al., 1978). 
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Fig. 5.26 (a) Lithium K-edge structure of fully lithiated graphite (LiC¢), metallic Li, and several 
other compounds. (b) Measured Li-K spectrum of Li-intercalated graphite, compared with FEFF- 
9.05 calculations using the final state rule (FSR) approximation, with and without inclusion of core 
hole (CH) and relativistic effects. From Wang et al. (2010b), copyright American Chemical Society 


Boron is easier to quantify due to its higher K-edge energy (188 eV). Using a 
50-nm-diameter probe in a field-emission STEM and second-difference recording, 
Leapman (1992) detected 1% of boron in silicon. The edge is partially obscured 
by EXELFS modulations from the preceding Si 73 edge. Boride particles (3-5 nm 
diameter) in silicon have been identified in CTEM core-loss images, confirming a 
previous HREM interpretation based on the coherency of SiB precipitates (Frabboni 
et al., 1991). 

Boothroyd et al. (1990) detected 0.5 at.% B in Ni3Al from parallel EELS data 
in which gain variations were reduced to 0.005% by use of iterative averaging 
(Section 2.5.5). A second-difference filter was applied to suppress EXELFS modu- 
lations from the preceding Ni M- and Al L-edges. Good energy resolution helps in 
distinguishing the intrinsically sharp boron edge from the more gradual EXELFS 
modulations. 

Boron-containing compounds have been investigated for use in neutron capture 
cancer therapy (BNCT). Bendayan et al. (1989) used electron spectroscopic imaging 
(ESI) to show that B-containing biopolymeric conjugates are absorbed intracel- 
lularly by colorectal cells. As a preliminary to BNCT studies, Zhu et al. (2001) 
analyzed data recorded from B/C test specimens, finding that concentrations down 
to 0.2% could be measured with 10% accuracy by conventional background fitting 
but with fitting windows before and after the boron K-edge, so that the background 
was derived by interpolation; see Section 4.4.1. 


5.4.3 Measurement of Carbon, Nitrogen, and Oxygen 


The absence of beam-induced hydrocarbon contamination is an obvious require- 
ment for the unambiguous identification and measurement of carbon. Microscope- 
induced contamination is reduced by using oil-free pumping and a liquid nitrogen 
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trap or cold finger, giving a low partial pressure of hydrocarbons in the vicinity of the 
specimen. Specimen-borne contamination can be minimized by careful attention to 
cleanliness during specimen preparation and by liquid nitrogen cooling of the spec- 
imen during microscopy, which reduces the mobility of hydrocarbon molecules on 
the specimen surface. Surface hydrocarbons are desorbed or rendered immobile (by 
polymerization) through mild baking of a specimen, either inside the microscope or 
before insertion, or by withdrawing the TEM condenser aperture and defocusing the 
illumination in order to strongly irradiate regions surrounding those to be analyzed. 
A more recent technique is to oxidize the carbon with oxygen radicals generated by 
a plasma source attached to the side of the TEM column (Horiuchi et al., 2009). 

The identification of carbide and nitride precipitates in steel has been an impor- 
tant application of core-loss spectroscopy. Figure 5.27 illustrates how TiC and TiN 
precipitates can be more easily distinguished from K-loss spectra than from their 
morphology or diffraction pattern. Atomic ratios of transition metals within car- 
bides have been estimated from the appropriate L-ionization edges (Fraser, 1978; 
Baker et al., 1982). 

Extraction replicas can be used to isolate small particles for spectroscopy, but the 
usual replicating materials (carbon or polymers) complicate the analysis of carbon. 
Garratt-Read (1981) used a 50-nm coating of evaporated aluminum for extraction 
and showed that the carbon content of presumed V(C, N) precipitates in vanadium 
HSLA steel was less than his detection limit, then about 5 at.%. Silicon extraction 
replicas, made by RF sputtering in argon, have also been used (Duckworth et al., 
1984). One potential problem is the loss of carbon from carbide precipitates when 
irradiated by a small probe. However, VC precipitates down to | nm diameter have 
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Fig. 5.27 Micrographs and core-loss spectra of (a) TiN and (b) TiC precipitates within a Ti-rich 
phase in stainless steel. From Zaluzec (1980), copyright Claitor’s Publishing, Baton Rouge, LA 
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been analyzed with parallel-recording EELS, allowing changes in composition to be 
monitored during their growth sequence (Craven et al., 1989). More recently, Craven 
et al. (2008) used EELS spectrum imaging and field evaporation atom probe tech- 
niques to study particles in high-strength low-alloy steel. These particles were found 
to be nitrogen-rich vanadium and chromium carbonitrides surrounded by an atmo- 
sphere of segregated atoms. The authors remark that the use of a dual-readout EELS 
system, giving core-loss and low-loss spectra at each pixel, would have allowed the 
vanadium content of each particle to be determined. 

Energy-loss spectroscopy of various forms of elemental carbon (Cg9, nanotubes, 
graphene, etc.) is discussed in Section 5.7.3. 

Nitrogen in solution within the y-phase of duplex stainless steel was measured 
by Yamada et al. (1992) as 0.26 + 0.04 wt.%. They used a 120-kV TEM in diffrac- 
tion mode and 15-mrad collection semi-angle. The nitrogen content of the a-phase 
was at or below the detection limit, about 0.2 at.%. This low limit was achieved 
by iterative averaging of the spectra and by employing a top-hat filter to give 
second-differential spectra. Quantification involved the use of narrow (2.5-eV) inte- 
gration windows around the N and Fe second-differential peaks, together with a 
calibration curve of N/Cr intensity ratio against N/Cr concentration, measured using 
high-nitrogen alloys. 

Some types of natural diamond contain octahedral-faceted inclusions a few 
nanometers in size, known as voidites. EELS measurements reveal a sharply peaked 
K-edge at 400 eV, indicating the presence of nitrogen. Analysis of 20 voidites 
(Bruley and Brown, 1989) gave an average nitrogen concentration about half the 
carbon concentration and independent of voidite size; see also Fig. 5.64. The shape 
of the nitrogen K-edge was consistent with the presence of N2 rather than NH3 
(proposed as an explanation for previous lattice images). Bruley (1992) found that 
nitrogen may be present at platelet defects in diamond, but only at a level of the 
order of a tenth of a monolayer. 

Oxygen can also occur in voidites, including those present in interplanetary dust 
particles. Erni et al. (2005) found their nanometer-sized vesicles to contain molec- 
ular oxygen (and a small fraction of H2O) as evidenced by the appearance of a n* 
peak at 531 eV; see Fig. 5.28. After subtraction of the K-edge of the matrix, the 
difference spectrum showed close agreement with that of O2 gas available from a 
core-loss database”; see Fig. 5.28b. 

Using a field-emission STEM and serial-recording spectrometer, Bourret and 
Colliex (1982) reported evidence for the segregation of oxygen at dislocation cores 
in germanium. Background fitting to the oxygen K-edge and extrapolation over 
100 eV revealed an oxygen signal ~1% of the background. During subsequent 
HREM imaging of the dislocations, oxygen was removed by the electron irradiation, 
with a characteristic dose estimated to be 10* C/em?. Subsequent studies have taken 
advantage of the improved collection efficiency of a parallel-recording spectrometer 
and second-difference techniques (Yamada et al., 1992). 


2A. Hitchcock and colleagues: http://unicorn.mcmaster.ca/corex/name-list.html 
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Fig. 5.28 (a) Oxygen K-edge recorded by monochromated EELS from a voidite and from the 
surrounding silicate matrix of an interplanetary dust particle, collected from the stratosphere. (b) 
Difference spectrum from (a) compared with the core-loss spectrum recorded from O2 gas, both 
spectra showing a 1* peak at 531 eV and o* peaks at 539 and 542 eV. From Erni et al. (2005), 
copyright Elsevier 


Disko et al. (1991) utilized the fact that the Al-Ly3 threshold in AlzO3 is shifted 
upward in energy by 4.5 eV (relative to Al metal) to distinguish regions of Al—Al and 
Al-O bonding in oxide-strengthened aluminum alloys formed by cryomilling in liq- 
uid nitrogen. Their procedure provided an indication of the oxide/metal fraction in 
different regions. The spectrometer was also used to acquire pairs of core-loss spec- 
tra shifted by 7 eV; a peak at 400 eV in the ratio (log-derivative spectrum) provided 
a qualitative but sensitive test for small percentages of nitrogen. Nitrogen quantifi- 
cation was achieved by careful AE~” background fitting and gave N/O ratio ~ 1 in 
some of the particles. 

Oxides and nitrides of silicon have been analyzed by examining the shape of 
the Si L-edge. By comparing the fine structure with that recorded from several can- 
didates, Skiff et al. (1986) showed that oxygen precipitation in Si produces SiO, 
not SiOz. They also established that precipitates in damaged regions of N* ion- 
implanted Si were Si3N4. Using the Si L- and O K-edges of SiO2 as standards, an 
analysis of semi-insulating polycrystalline oxygen-doped silicon (SIPOS) revealed 
only 15 at.% of oxygen (Catalano et al., 1993), but after annealing at 900°C the 
material decomposed into silicon nanocrystals and a matrix containing 36 at.% oxy- 
gen. Kim and Carpenter (1990) have shown that the native oxide formed on silicon 
at room temperature has a composition close to SiO, suggesting that metastable 
amorphous solid solutions of Si and O can exist as a single phase over the whole 
range from Si to SiO2. 


5.4.4 Measurement of Fluorine and Heavier Elements 


Some fluorinated organic compounds have normal biological activity and can be 
used as molecular markers for specific sites in cells. By forming energy-selected 
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images with the fluorine K-edge, the segregation of (difluoro)serotonin was demon- 
strated (Costa et al., 1978). Fortunately, compounds in which fluorine is directly 
attached to an aromatic ring are relatively stable under electron irradiation, some- 
times withstanding doses as high as 10+ C/cm* if the specimen is cooled to — 160°C 
(Ciliax et al., 1993). 

The distribution of sulfur, phosphorus, and calcium is of interest in biological 
systems but the dose required for mapping these elements by x-ray K-emission spec- 
troscopy is often destructive. EELS offers the option of using L-shell ionization, for 
which the cross section is relatively large, allowing higher detection sensitivity (see 
later, Fig. 5.32). Because of the low L-edge energies of S and P (135, 165 eV) the 
spectral background is high, even for very thin specimens. This background can be 
suppressed by using first- or second-difference techniques (Section 4.4.5), the sensi- 
tivity then depending on the noise level and to some extent on the energy resolution. 
Detection of Ca, Ti, and transition elements is helped by the fact that these ele- 
ments display sharp white-line peaks at the ionization threshold (Fig. 3.45). These 
peaks become amplified in difference spectra, so that less than 100 ppm of alkaline 
earths, transition metals, and lanthanides could be detected in a glass test specimen 
(Leapman and Newbury, 1993). 

Based on calculations and experimental results, Wang et al. (1992) prescribed 
optimum conditions for the detection of phosphorus in biological tissue with a 
parallel-recording spectrometer: t/A ~*~ 0.3 and a 15-eV shift between spectra if 
first-difference recording is used. At that time, EELS was estimated to be 15 times 
more sensitive than EDX spectroscopy: with 0.5-nm beam current and 100-s record- 
ing time, the minimum detectable concentration was calculated as 8.4 mmol/kg 
(100 ppm), equivalent to 34 phosphorus atoms in a 15-nm probe. 

Electron spectroscopic imaging (ESI) in a conventional TEM has been used 
extensively to provide qualitative or semiquantitative elemental maps of biologi- 
cal specimens. Since phosphorus is a constituent of DNA, P—L23 images have been 
employed to investigate DNA configurations within 80 s ribosomes (Shuman et al., 
1982) and chromatin nucleosomes (Ottensmeyer, 1984). The latter contain about 
300 atoms of phosphorus and the signal/noise ratio of the corresponding phosphorus 
signal was about 30. Képf-Maier (1990) employed 80-keV energy-selected imag- 
ing to analyze the distribution of titanium and phosphorus in human tumors as a 
function of time after therapeutic doses of titanocene dichloride. The maximum Ti 
concentration in cell nuclei and nucleoli occurred after 48 h and was accompanied 
by an enrichment of phosphorus, confirming that the primary interaction occurs with 
nucleic acids, particularly DNA. 

ESI has also been used to image the distribution of heavier elements such as tho- 
rium, cerium, and barium (formed as cytochemical reaction products) in order to 
detect enzyme activities within a cell (Sorber et al., 1990). In some cases, K-edges 
were used, e.g., to detect aluminum in newt larvae (BG6hmer and Rahmann, 1990). 
Here the advantages of EELS over EDX spectroscopy are less obvious. However, 
edges in the 1000-3000 eV region can have good signal/background ratios, rel- 
atively unaffected by plural scattering, so specimens can be as thick as 0.5 wm 
(Egerton et al., 1991). 
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5.5 Spatial Resolution and Detection Limits 


The spatial resolution obtainable in energy-loss spectroscopy or energy-filtered 
imaging depends on several factors, now to be discussed. In the case of core-loss 
microanalysis, spatial resolution is closely connected with the concept of elemental 
detection limits, as explained in Section 5.5.4. 


5.5.1 Electron-Optical Considerations 


In a scanning transmission electron microscope (STEM), the spatial resolution 
of the image (or of point analysis) depends on the diameter d of the incident 
probe. The latter can be made small by demagnifying the source with condenser 
lenses and probe-forming lenses but the electron optical brightness B (i.e., cur- 
rent/area/steradian) at the plane of focus remains the same as at the source. Source 
brightness is roughly proportional to the accelerating voltage; for Vp = 100kV, 
Bx 3x 10’ A m® sr! for a Schottky source and ~10! A m® sr! for a 
cold field-emission source. The smallest obtainable probe diameter can therefore be 
written as 


d = 21'/? /(7aB'/*) (5.16) 


where J is the probe current and a the probe convergence semi-angle. As seen from 
Eq. (5.16), the product (da) of beam diameter and angular spread is constant within 
the optical system. Large demagnification gives small d but large a, resulting in 
spherical aberration tails (extending to a radius * C,a>) unless an aberration cor- 
rector is used. A further result of large @ is a small depth of focus: if the probe is 
focused at the mid-plane of a specimen of thickness f, its geometrical diameter is at 
at the beam entrance and exit surfaces. 

In a thermionic source TEM, a sub-nanometer probe can be formed but with 
relatively low current (a few picoamperes). The current—density profile contains a 
sharp central peak surrounded by electron-beam tails that contain an appreciable 
fraction of the incident current and may extend for many nanometers (Cliff and 
Kenway, 1982). Deconvolution techniques have been used to correct concentration 
profiles for the effect of these aberration tails, based on a measured or calculated 
incident beam profile (Thomas, 1982; Weiss and Carpenter, 1992). 

If a TEM is operated in broad-beam imaging mode, a small region of specimen 
can be chosen for EELS by a selected-area diffraction aperture or (in TEM imaging 
mode) the spectrometer entrance aperture. However, the imaging lenses suffer from 
both spherical and chromatic aberration, the latter being severe at high energy loss 
(Section 2.3.2) unless the high voltage is raised by an equal amount or the objec- 
tive lens refocused (Schenner and Schattschneider, 1994). Spherical aberration can 
seriously degrade the resolution in thicker specimens, but can be limited by means 
of an objective aperture or eliminated by a post-specimen aberration corrector. The 
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spatial resolution of an energy-filtered image may also be limited by the pixel size 
of the recording camera, if the image magnification is not sufficiently high. 


5.5.2 Loss of Resolution Due to Elastic Scattering 


When an electron beam enters a specimen, it spreads laterally. Most of this spread- 
ing comes from elastic scattering, whose average deflection angle is larger than 
that of the inelastic scattering (Fig. 3.7). As depicted in Fig. 1.12, beam broaden- 
ing degrades the spatial resolution of x-ray emission spectroscopy; simple models 
suggest a broadening proportional to °/, amounting to 10 nm or more for a 
100-nm-thick foil and 100-keV incident electrons (Goldstein et al., 1977). 

In the case of EELS, the angular divergence of the beam entering the spectrom- 
eter can be limited to some chosen value 6 by means of a collection aperture. This 
aperture also tends to exclude electrons present in incident probe aberration tails 
and stray electrons produced in the TEM illumination system. The EELS signal is 
unaffected by secondary electrons generated by the electron probe, which can pro- 
duce x-rays away from the incident beam. These various factors imply somewhat 
better spatial resolution than is obtainable from EDX spectroscopy, as confirmed 
experimentally (Collett et al., 1984; Titchmarsh, 1989; Geng et al., 2009). 

For an amorphous specimen, the collimation effect of the angle-limiting aperture 
can be estimated from simple geometry, as shown in Fig. 5.29a. If scattering occurs 
at the top of the foil, the volume of specimen sampled by the recorded electrons 
(contained within a cone of semi-angle §) is largest; if it occurs at the bottom, this 
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Fig. 5.29 (a) Beam broadening due to scattering at a point P, a distance z from the exit surface 
of a specimen. The shaded area represents the excited volume that lies within a distance r of the 
optic axis and gives rise to inelastic scattering within the collection aperture. (b) Fraction F.(r) of 
the elastically scattered electrons (scattering angle < 8) contained within a radius r of the incident 
beam axis. This estimate assumes that the angular width of elastic scattering is large compared to 
that of inelastic scattering and large compared to the collection semi-angle f 
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volume is zero. Averaging over the thickness of the specimen, the fraction F.(r) of 
electrons that have traveled radial distances up to r within the specimen is given 
by Fig. 5.29b. Almost 90% of the electrons collected by the aperture are contained 
within a specimen volume of diameter 27 ~ Bt, which is below | nm for t = 50 nm 
and 6 < 20 mrad. 

In the case of a crystalline specimen, the collection angle may be chosen to 
exclude Bragg beams, suggesting a radial spread (less than ff) that arises mainly 
from inelastic scattering and incident beam convergence. Even in the absence of 
an aperture, electron channeling can reduce the radial broadening (Browning and 
Pennycook, 1993). In effect, beam broadening is delayed up to a depth (below the 
entrance surface) at which s-type Bloch waves (more localized at the atomic cen- 
ters) are dispersed by inelastic scattering. STEM imaging of atomic columns in a 
thin crystal relies on this principle (Pennycook and Jesson, 1991). A practical way 
of minimizing beam spreading is to orient the specimen so as to avoid strongly 
diffracting conditions. 


5.5.3 Delocalization of Inelastic Scattering 


In Section 3.11, we discussed the delocalization of inelastic scattering as a conse- 
quence of the long-range nature of the electrostatic interaction between an incident 
electron and the atomic electrons in a solid. Delocalization was represented by a 
point-spread or object function, of width inversely related to the angular width of 
inelastic scattering. Here we attempt to represent the delocalization in terms of a 
single number, in order to roughly estimate its contribution to the spatial resolution 
of EELS or EFTEM imaging and show how it depends on experimental parameters 
such as energy loss, incident energy, and collection angle. 

A common wave-optical measure of resolution is the Rayleigh diffraction limit: 
Ax = 0.6A/sinB ~ 0.64/86 where B (<<l1 rad) is the aperture semi-angle of the 
optical system used to form an image. This formula applies to a situation where 
scattering uniformly fills the aperture, whereas the angular width of inelastic scat- 
tering is relatively small. In the absence of any angle-limiting aperture, half of the 
inelastic scattering is contained within the median scattering angle (@), and for a 
Lorentzian angular distribution with a cutoff at 0,, (0) © (Og 6,)'/2 (Section 3.3). 
Taking @¢ as the Bethe-ridge angle (20¢)!/* gives (0) © 1.2(6g)°*/*, and the object 
width containing 50% of the scattered electrons can be estimated as 


Lso © 0.6A/ (0) © 0.5A/0¢°/4 (5.17) 


If the imaging system contains an aperture (semi-angle 6), we can roughly estimate 
its effect by combining the lateral broadenings in quadrature: 


(dsq)* © (0.54./0,7/4)* + (0.62./B) (5.18) 
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Fig. 5.30 Delocalization distance for inelastic scattering, calculated for 100-keV electrons from 
Eq. (5.17) with a free-electron plasmon cutoff (solid line) and a Bethe ridge cutoff (dashed line). 
The dash-dot line represents Eq. (5.18) with 8 = 10 mrad. Data points represent TEM-EELS 
measurements and calculations (see text) 


Values of Ls59 and dsp for Eg = 100 keV and 6 = 10 mrad are shown in Fig. 5.30. 
The main conclusion is that the localization width decreases with increasing energy 
loss, from a few nanometers at low energy loss to subatomic dimensions in the case 
of higher energy ionization edges. 

Figure 5.30 also includes experimental estimates of delocalization. Triangles 
(Muller and Silcox, 1995a) represent measurements of inelastic intensity as a func- 
tion of distance away from the edge of a 3-nm amorphous carbon film. Inverted 
triangles (van Benthem et al., 2001) are based on STEM measurements of a 
grain boundary in SrTiO3. The two square data points (Shuman et al., 1986) are 
based on the measured angular distribution of scattering from carbon and ura- 
nium in stained catalase crystals. Hollow circles represent STEM measurements 
on LaMnO3/SrTiO3 superlattices (Shao et al. 2011). The hexagonal data point of 
Adamson-Sharpe and Ottensmeyer (1981) is based on EFTEM phosphorus map- 
ping, while the data point due to Mory et al. (1991) is derived from a statistical 
analysis of STEM images of uranium atoms, recorded with O45 loss electrons. 
Clearly the experimental conditions differ among these measurements, accounting 
for some of the scatter in the data. Values in Fig. 5.30 have been adjusted to 100- 
keV incident energy, based on Eq. (5.17), although this equation predicts little Eo 
dependence (see Fig. 5.31) because changes in A and 6,°* largely cancel. 

Inelastic scattering distributions can also be calculated from first principles. Kohl 
and Rose (1985) employed quantum-mechanical theory to calculate intensity pro- 
files for the inelastic image of a single atom, using a dipole approximation. Schenner 
and Schattschneider (1994) extended the method to include the effects of spheri- 
cal aberration, chromatic aberration, and objective lens defocus, while Muller and 
Silcox (1995a) investigated the effect of different detector geometries. Cosgriff et al. 
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(2005) calculated core excitation STEM images of single atoms using Hartree—Fock 
wavefunctions. Xin et al. (2010) reported more recent quantum-mechanical cal- 
culations within the dipole approximation. Data from all of these calculations are 
included in Fig. 5.30. 

The solid line in Fig. 5.30 again represents Eq. (5.17) but with an alternative 
choice of cutoff angle, based on Eq. (3.50) and more appropriate to plasmon losses. 
Note that inelastic delocalization refers to modulations in plasmon signal arising 
from changes in chemical composition. Particularly in thicker specimens, variations 
in the amount of plural (elastic + plasmon) scattering can introduce diffraction con- 
trast into the plasmon-loss image and this contrast has high resolution because of 
the more localized nature of elastic scattering (Egerton, 1976c; Craven and Colliex, 
1977). 

Elastic scattering also complicates the situation for core losses, especially in crys- 
talline specimens. Spence and Lynch (1982) calculated the Be K-loss intensity for a 
single Be atom on a 7.2-nm gold film, where the delocalization length exceeds the 
substrate lattice spacing, resulting in lattice fringe modulation of the K-loss profile 
(representing Bragg scattering of Be K-loss electrons in gold). These simulations 
showed that atoms not selected by the energy filter may appear in an atomic resolu- 
tion inelastic image if the selected energy loss is not sufficiently high. The absence 
of boron atoms in atomic resolution LaBg energy-selected images was similarly 
explained as arising from strong elastic scattering by La columns onto adjacent B 
columns, creating a boron signal with little contrast (Lazar et al., 2010). The com- 
bined effects of elastic and inelastic scattering in crystalline specimens have been 
discussed by Schattschneider et al. (1999, 2000); see Section 3.11. 

Kimoto and Matsui (2003) used the focus dependence of energy-filtered lat- 
tice fringe contrast to measure a lateral coherence length associated with inelastic 
scattering, obtaining values similar to those given by Eq. (5.17). 

In addition to its relevance to energy-filtered imaging and spectroscopy, delo- 
calization of inelastic scattering is of importance in channeling experiments 
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(Section 5.6.1). Here core-loss scattering is recorded in a diffraction plane and the 
aperture term of Eq. (5.18) is absent, so the localization distance has subatomic 
dimensions at high energy loss. This situation allows the nonuniformity in cur- 
rent density that arises from channeling (Section 3.1.4) to appreciably affect the 
core-loss intensity, dependent on the specimen orientation. Bourdillon et al. (1981) 
and Bourdillon (1984) fitted their measured orientation effects with a localiza- 
tion distance 1/(476¢) deduced from time-dependent perturbation theory (Seaton, 
1962). 

One advantage of the simple treatment represented by Eq. (5.17) is that it pro- 
vides an estimate of delocalization in cases where the physical properties of the 
specimen are not all known and more sophisticated calculations are not possible. 
The formula also suggests how the delocalization distance might vary with the 
incident energy Ep. Taking 0g = E/(ymov) gives 


Ls &A/(Og)* « yo yy Fy BP) = yA? (5.19) 


Above 100 keV, this variation barely exceeds 10%; for Eg = 30 keV the delocaliza- 
tion width is about a factor of 20% smaller than at 100 keV; see Fig. 5.31. 


5.5.4 Statistical Limitations and Radiation Damage 


Because inner-shell ionization cross sections are relatively low, the core-loss inten- 
sity may represent a rather limited number of scattered electrons. The spatial 
resolution, detection limits, and accuracy of elemental microanalysis are then 
strongly influenced by statistical considerations. In the analysis below, we evalu- 
ate these statistical constraints on the detection of a small quantity (NV atoms per unit 
area) of an element that is present in a matrix (or on a support film) having an areal 
density of NM, atoms per unit area, where N; >> N. For convenience of notation, 
all intensities are assumed to represent numbers of recorded electrons. However, the 
radiation fluence or dose D received by the specimen (during spectrum acquisition 
time T) is in units of Coulombs per unit area. 

According to Eq. (4.65), the core-loss signal, recorded with a collection semi- 
angle 6 and integrated over an energy window A, is given by 


i * N I(B, A)ox(B, A) (5.20) 


where o;(f, A) is the appropriate core-loss cross section and /(6, A) is the intensity 
in the low-loss region, integrated up to an energy loss A. If the energy window 
A contains most of the electrons transmitted through the collection aperture but 
this aperture is small enough to exclude most of the elastic scattering (mean free 
path A¢), 


I(B, A) © (I/e)t exp(—t/de) = (t/4)*(D/e)exp(—t/re) (5.21) 
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where / is the probe current (in amp), e the electronic charge, and d the probe diam- 
eter. We express /(6, A) in terms of an electron dose (or fluence) D because, in the 
absence of instrumental drift, radiation damage provides the fundamental limit to 
acquisition time and is certainly the main practical limitation for organic specimens. 
For small A, an additional factor of exp(—t/A;) would be required in Eq. (5.21), Aj 
being a mean free path for inelastic scattering (Leapman, 1992). 

An equation analogous to Eq. (5.20) can be written for the background intensity 
beneath the edge (as shown in Fig. 4.11): 


Ty © NAB, A)ov(B, A) (5.22) 


where o»(6, A) is a cross section for all energy-loss processes that contribute to 
the background. The core-loss signal/noise ratio for an ideal spectrometer is given 
by Eq. (4.61), but using Eq. (2.42) to make allowance for the detective quantum 
efficiency (DQE) of the detector, the measured signal/noise ratio is 


SNR = (DQE)!/7Ik/(U + hip)? © (DQE) 7 h(hlp) (5.23) 


where h is the factor representing statistical error associated with background sub- 
traction, typically in the range 5—10 for an extrapolated background (Fig. 4.13). 
Combining Eqs. (5.20), (5.22), and (5.23), the atomic fraction of an analyzed 
element is 


_N _— SNR (hoy(B, A)\'/” “1p 
f= N. ox(B, A) (Gas) a me 


Taking SNR = 3 (98% certainty of detection if Gaussian statistics apply) and using 
Eq. (5.21), the minimum detectable atomic fraction is 


3 1.1 hop(B, A) \1/? t 
MAF = fin = exp (5.25) 
ox(B, A) ( d ) (son on) (5) 


For a given radiation dose D, and assuming the specimen is sufficiently uniform, a 
large beam diameter d favors the detection of low atomic concentrations. This is so 
because the radiation damage is spread over a larger volume of material, permitting 
a larger beam current or acquisition time. A thermionic emission source, capable of 
providing a large beam current, is then preferable. On the other hand, the minimum 
detectable number of atoms, MDN = (z7/ 4)dfonin N;, 1s given by 


Tt 2.7d (mioeoy" ( t ) 
MDN = —@"fmnin © exp ( — (5.26) 
4 ox(B, A) \ (DQE)(D/e) 2he 


For a given radiation dose D, a small probe diameter d is required to detect the 
minimum number of atoms, favoring the use of a field emission source. 

As seen from Eggs. (5.25) and (5.26), MAF and MDM will be lowest for an ele- 
ment with a high core-loss cross section o;(f, A); in other words, a low-energy 
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edge or an edge sharply peaked at the threshold. However, edges below 100-eV 
loss present problems because op(8, A) is large (due to valence-electron excita- 
tion) and the background is further increased by plural scattering. As discussed in 
Section 4.4.3, SNR and therefore MDN and fmin depend on the position and width 
of the background fitting and integration windows. 

Ignoring the exponential term and possible variation of DQE with Eo, Eqs. (5.25) 
and (5.26) predict that (MAF)? and (MDN)? are proportional to opog ae where 
oa = e/D is a damage cross section. If ionization damage (due to inelastic scatter- 
ing) prevails, these cross sections are all proportional to yv-* (Section 5.7.5), so both 
MAF and MDN should be independent of the incident electron energy. A computer 
program is available for modeling energy-loss spectra, including instrumental and 
shot noise, and is useful for predicting whether a particular ionization edge will be 
visible for a given specimen composition and thickness and particular TEM operat- 
ing conditions (Menon and Krivanek, 2002). Since electron dose to the specimen is 
often the factor limiting resolution, dose-efficient recording is important; method- 
ology and acquisition scripts have been developed to optimize the process (Sader 
et al., 2010; Mitchell and Schaffer, 2005). 

Figure 5.32 shows the detection limits for calcium (L-loss signal) within a 
30-nm-thick carbon matrix, calculated for 100-keV incident electrons and parallel- 
recording EELS. Single calcium atoms should be detectable with a sub-nanometer 
probe, but would involve a high radiation dose; even in the absence of radiolytic 
processes, D ~ 10° C/cm? can remove six layers of carbon atoms by sputtering 
(Leapman and Andrews, 1992). For larger probe sizes (or larger scanned areas in 
STEM), the detection of Ca/C ratios down to a few parts per million is predicted, in 
agreement with measured error limits of about 0.75 mm/kg or 9 ppm (Shuman and 
Somlyo, 1987; Leapman et al., 1993b). Leapman and Rizzo (1999) have pointed out 
that although electrons may destroy the structure of biological molecules, this does 
not necessarily prevent an accurate measurement of elements such as Ca and Fe that 
were originally present. 

Although calcium represents a favorable case, the detection limits for phos- 
phorus are comparable. From energy-selected CTEM images, Bazett-Jones and 
Ottensmeyer (1981) reported a phosphorus signal/noise ratio of 29 from a nucle- 
osome containing 140 base pairs of DNA (280 phosphorus atoms), equivalent to the 
detection of 29 atoms at SNR = 3. Using a STEM and parallel recording spectrom- 
eter, Krivanek et al. (1991a, b) measured the O45 signal from clusters of thorium 
atoms on a thin carbon film; quantification revealed that the signal originated from 
just a few atoms. 

Suenaga et al. (2000) were the first to report images of single atoms whose 
atomic number could be identified by EELS; see Fig. 5.33a. Gadolinium (Gd) 
atoms were placed within Cg2 fullerene molecules that were in turn encapsu- 
lated within single-wall carbon nanotubes (forming so-called peapods). The 100-kV 
STEM probe produced a radiation dose approaching 10+ C/cm? but the atoms were 
confined sterically and secondary electrons (which cause most of the damage in 
fullerenes and organic compounds) would be free to escape. Even so, an assess- 
ment of the number of atoms (from a background-subtracted Gd N-edge and using 
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Fig. 5.32 Minimum number of calcium atoms (dotted lines) and minimum atomic fraction of 
calcium (dashed lines) detectable in a 30-nm carbon matrix (N; = 2.7 x 10!7 cm~?), calculated 
from Eqs. (5.26) and (5.25) for different doses of 100-keV electrons. The calculations assume 
DQE = 0.5, h = 9, Ae = 200 nm, 6 = 10 mrad, and A = 50 eV and hydrogenic cross sections 
oL(B, A) = 9.9 x 10-7! cm7? and op(B, A) = 1.9 x 10-7! cm~?. The circular data point is from 
Leapman et al. (1993b) and the square point from Shuman and Somlyo (1987), both experimental 
data. For scanned probe analysis, the effective probe diameter is (44/)!/*, A being the scanned 
area 


a Hartree—Slater cross section of 8700 barn) yielded numbers between 0 and 4, sug- 
gesting that some Gd atoms migrated along the nanotube and clustered together 
under the intense irradiation. Subsequent measurements at 60 keV (Suenaga et al., 
2009) showed no observable movement of Er atoms, although Ca atom images 
had 300 times less intensity than expected from the L-shell cross section (4700 
barn, A = 10 eV), suggesting movement out of the beam during the electron 
exposure. 

Using an aberration-corrected STEM, Varela et al. (2004) identified single La 
atoms in crystalline CaTiO3; see Fig. 5.33c. The signal/noise ratio was said to be 
sufficient to determine the electronic properties (e.g., valency) of a single atom. The 
authors point out that because of channeling, the intensity of different elemental 
signals varies differently with depth, which could allow the depth of an atom to be 
determined by comparison with computer simulations of probe spreading. 

Riegler and Kothleitner (2010) have given a revised version of Eq. (5.25) for the 
case where the atomic fraction is measured using multiple least-squares fitting to 
spectral standards, rather than background extrapolation and integration: 
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Fig. 5.33 (a) Single Gd 
atoms (red) inside a 
single-wall carbon nanotube 
(blue), recorded with a 
650-pA 0.5-nm STEM probe 
and (b) energy-loss spectrum 2000 
acquired in 35 ms from the 
central pixel of a Gd atom 
(Suenaga et al., 2000). (ce) 
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For the case of Cr in Al2O3 (ruby), they calculate a minimum atomic fraction of 
0.04% for a thermionic source TEM (0.3 nA beam current and 40 s recording 
time) and 0.12% for a Schottky source TEM with monochromator (0.1 nA and 60 s 
recording time). 

It should also be noted that multivariate statistical analysis (MSA) can isolate 
and remove noise components from spectrum image data (Borglund et al., 2005) 
after visual identification of an “elbow” on the scree plot; see Section 4.4.4. It 
seems likely that noise arising from background and matrix (N;) components can 
be eliminated, whereas that associated with characteristic signal cannot. If so, Eqs. 
(5.25) and (5.26) will overestimate MDN and MAF when MSA is employed and the 
analyzed element is present in small concentrations. 
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5.5.4.1 Comparison with EDX Spectroscopy 


The characteristic x-ray signal (number of detected photons) recorded by an EDX 
detector in response to a probe current J within acquisition time T is 


I, = NU /e) t mgox(t, Eo) nx (5.28) 


where wz, is the fluorescence yield and o;(7r, Eo) is the total ionization cross section 
for shell k; nx is the collection efficiency of the x-ray detector, including photon 
absorption in the front end, the detector window, and the specimen. Using Eqs. 
(5.20) and (5.28), we can compare the EELS signal i and the x-ray signal [, 
acquired under the same conditions: 


Ik JA —t 
B ox(B, A) exp ( ) (5.29) 
Ty Nx @ox(I, Eo) Xe 


The exponential term (typically 0.3) represents loss of EELS signal as a result of 
elastic scattering outside a typical collection aperture. Core-loss intensity is also 
reduced by the aperture and because only that fraction lying within an energy range 
A of the ionization threshold is utilized. As a result, the cross-sectional ratio in 
Eq. (5.29) is appreciably less than unity; 0.1 might be a typical value. However, 
the x-ray fluorescence yield, which is close to 1 for K-lines of heavy elements, is 
below 0.05 for photon energies below 2000 eV (aw, ~ 0.002 for carbon-K x-rays). In 
addition, x is below 0.1 even for an x-ray detector with a high solid angle (~1 sr) 
and considerably less for low-energy photons because of absorption in the specimen 
and at the detector front end. As a result, the EELS signal is typically less than the 
EDX signal for heavy elements but larger by a factor of several hundred for a light 
element such as carbon. 

A major advantage of EDX spectroscopy is that the background to the char- 
acteristic peaks is relatively low. Moreover, this background can be subtracted by 
interpolation rather than extrapolation, equivalent to h = 2 in Eq. (4.61), so that the 
signal/noise ratio is 


(SNR), = Ic/U; + 2h)'/? (5.30) 


For low elemental concentrations, /, cannot be neglected relative to 7, and a model 
for the background intensity [) is required in order to calculate detection limits (Joy 
and Maher, 1977). As an alternative, Leapman and Hunt (1991) obtained the SNR 
and (SNR), from x” values produced by MLS fitting to energy-loss and x-ray spec- 
tra recorded simultaneously from test specimens (10-nm carbon films containing 
small concentrations of F, Na, P, Cl, Ca, and Fe). The SNR/(SNR), ratio is plotted 
in Fig. 5.34 and suggests that EELS offers higher sensitivity for low-Z elements and 
medium-Z elements with L23 edges in the 30-700 eV range. Watanabe et al. (2003) 
point out that this comparison was performed using very thin specimens and that the 
relative sensitivity of the two techniques depends very much on specimen thickness. 
EELS may be more sensitive for thicknesses (typically 30-50 nm) that provide good 
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Fig. 5.34 Elemental sensitivity of EELS relative to EDX spectroscopy, assuming a parallel- 
recording electron spectrometer and an x-ray detector with solid angle 0.18 sr (left-hand axis) 
or 0.9 sr (right-hand axis). Based on second-difference MLS processing of both spectra and sig- 
nal/background ratios measured using 100-keV incident electrons (Leapman and Hunt, 1991). The 
increase in sensitivity between Cl and Ca is due to the emergence of white-line peaks at the L273 edge 


signal/noise ratio at an ionization edge, whereas EDXS provides better sensitivity 
(at least for Cu—Mn alloys) in thicker specimens. 

As explained in Section 5.6.1, the x-ray signal from an ultrathin specimen may be 
somewhat more localized than the EELS signal recorded close to the corresponding 
ionization edge. But in a typical specimen, the spatial resolution of the x-ray signal 
is degraded by beam spreading, whose effect in EELS can be controlled by using a 
spectrometer entrance aperture; see Section 5.5.2. 


5.6 Structural Information from EELS 


As discussed in Chapters 3 and 4, inelastic scattering in a solid is sensitive to the 
crystallographic and electronic structure of a specimen, as well as its elemental com- 
position. As a result, structural information can be obtained from the dependence 
of the inelastic intensity on specimen orientation, from the angular dependence of 
scattering, or from fine structure present in the low-loss or core-loss regions of the 
energy-loss spectrum. 


5.6.1 Orientation Dependence of Ionization Edges 


In a crystalline specimen, the transmitted electron wavefunction can be written as a 
sum of Bloch waves, the probability of inner-shell excitation being proportional to 
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the square of the modulus of this sum (Cherns et al., 1973). The total intensity dis- 
tribution has the periodicity of the lattice, with nodes and antinodes whose position 
within each unit cell is a sensitive function of crystal orientation (Section 3.1.4). At 
high energy loss, where electron scattering from inner shells is localized near the 
center of each atom (Section 5.5.3), the core-loss intensity will therefore change as 
the specimen is tilted about an axis perpendicular to the incident beam. In x-ray 
studies, this is known as the Borrmann effect. While it represents a source of error 
in EELS elemental analysis of crystalline materials, it can be used constructively to 
determine the crystallographic site of a particular element, selected by means of its 
ionization energy. 

The orientation of a specimen relative to the incident beam determines the chan- 
neling condition, which affects the rate of inner-shell ionization and the rate of x-ray 
production, as discussed in Section 3.1.4. The ALCHEMI method of atomic site 
determination is based on planar channeling (Spence and Taft¢, 1983); the orienta- 
tion dependence of x-ray emission is measured for two elements that lie on alternate 
planes containing the incident beam direction. The crystallographic site of a third 
element is then determined by comparing its x-ray orientation dependence with that 
of the other two. An alternative axial channeling method involves measuring charac- 
teristic x-ray signals from a matrix element and an impurity (atomic site unknown) 
with the incident beam traveling first along a low-index zone axis and then in a ran- 
dom orientation; the ratio of the fractional changes in signal gives the fraction F 
of impurity atoms that lie on particular atomic columns of the matrix (Pennycook, 
1988). To reduce the influence of experimental errors, Rossouw et al. (1989) applied 
multivariate statistical procedures to the analysis of spectra recorded under several 
zone axis diffraction conditions. 

In the EELS case, primary electrons that have caused inner-shell excitation may 
travel further within the specimen before being detected. In doing so, they are 
again subject to channeling, which affects their probability of escape in a particular 
direction (an effect known as blocking). If this direction is defined by a collection 
aperture centered about the incident beam direction, the blocking effect of the crys- 
tal on the inelastically scattered electrons augments the channeling of the incident 
beam (equivalent to double alignment in particle-channeling experiments), and the 
orientation dependence observed by EELS can be larger than that seen in x-ray 
emission spectroscopy. If R is the factor by which an elemental ratio (ratio of the 
core-loss signals due to two different elements present at nonequivalent crystal- 
lographic sites) changes with orientation, one might expect (Taftg and Krivanek, 
1982a) 


ReELs = (Rx)* 6.31) 


However, factors related to the localization of inelastic scattering reduce Rggrs rel- 
ative to (R,)*. Characteristic x-rays can be produced by any energy loss above the 
ionization threshold E; and if the core-loss intensity is proportional to E~*, the mean 
energy loss is 
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(E) = fertae/ [evae x Ex(s — 1)/(s — 2) (5.32) 


For scattering through all angles, s is close to 3 for most energy losses (Fig. 3.40), 
giving (E) © 2E;, whereas the core-loss signal is usually measured at energy losses 
just above the edge threshold, corresponding to an average loss close to Ex. As seen 
from Fig. 5.30, this lower mean loss implies less localized scattering and a reduced 
Borrmann effect (Bourdillon et al., 1981). 

In fact, delocalization of the inelastic scattering will reduce both Regis and R, in 
the case of light elements (such as oxygen) with low threshold energies. Pennycook 
(1988) observed that the fractional change in x-ray signal between axial and random 
orientations is reduced for photon energies below 5000 eV (by a factor of 0.6 at 
1300 eV). Calculations of Spence et al. (1988) showed the reduction to be less for 
planar channeling; Qian et al. (1992) observed orientation dependence of the oxygen 
K-signal by both EELS and EDX spectroscopy but they emphasize that a correction 
factor for delocalization is necessary for quantitative analysis. Rossouw et al. (1989) 
have argued that their statistical procedure makes approximate allowance for the 
degree of localization. 

Fortunately, the localization and orientation dependence in EELS can be 
increased (at the expense of reduced signal) by collecting electrons deflected 
through larger angles. For planar channeling (incident beam far from a major zone 
axis), the chosen scattering angle can be made arbitrarily large (without affecting the 
blocking or channeling conditions) by displacing the collection aperture in a direc- 
tion parallel to the appropriate Kikuchi band (Tafto and Krivanek, 1981). Beam 
deflector coils can translate the diffraction pattern in an appropriate direction rel- 
ative to the spectrometer entrance aperture, making use of the observed Kikuchi 
bands (Taftg and Krivanek, 1982a). 

An example of these orientation effects is shown in Fig. 5.35. Spectra (a) and (b) 
were recorded with the collection aperture centered around the zero-order diffrac- 
tion spot; the ratio of the Mg and Al K-edge intensities varies by only a factor of 
1.8 when the specimen orientation is changed so that the Kikuchi line at the edge of 
the (400) band crosses the collection aperture. In cases (c) and (d), the diffraction 
pattern has been displaced parallel to the (400) band in order to increase the local- 
ization of the inelastic scattering entering the aperture; as a result, the Al/Mg ratio 
changes by a factor of 9 as the specimen is tilted through the (400) Bragg condition. 
If the illumination and detector apertures are placed on opposite sides of a Kikuchi 
line, the channeling and blocking effects largely cancel (Taftg and Krivanek, 1981). 

To determine the atomic site of a particular element from planar channeling, 
the specimen orientation must be carefully chosen. In the case of minerals with 
the spinel structure (general formula AB O04), the incident beam should be nearly 
parallel to (800) planes but away from a principal zone axis. The standing-wave 
field is then determined mainly by those (800) planes that contain all of the oxygen 
atoms and two-thirds of the metal atoms on octahedral sites (Taftg et al., 1982). The 
remaining metal atoms occur in tetrahedral coordination on the intervening (800) 
planes and are strongly excited if the incident beam direction lies just outside the 
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Fig. 5.35 Core-loss spectra measured from a normal spinel (MgAl)O4) with the incident beam 
nearly parallel to (100) planes. Insets show the location of the incident beam (solid circle) and 
collection aperture (open circle) relative to the (400) and (800) Kikuchi bands. In (a) and (b), the 
collection aperture is centered about the illumination axis; the Mg/AI intensity ratio increases by 
a factor of 1.8 as the crystal is tilted so that the aperture lies outside rather than inside the (400) 
band. In (c) and (d), the collection aperture has been displaced by 10 mrad parallel to the (400) 
band to increase the localization of core-loss scattering; the Mg/AI ratio then increases by a factor 
of 9 between the two crystal orientations. From Taftg and Krivanek (1982a), copyright Elsevier 


(400) Kikuchi band (Fig. 5.35b). Conversely, the octahedral (and oxygen) atoms are 
strongly excited if the incident beam lies just inside the (400) band (Fig. 5.35a). In 
a normal spinel, A atoms are on tetrahedral sites and B atoms on octahedral sites; 
in an inverse spinel, the tetrahedral sites are filled by half of the B atoms, octahedral 
sites accommodating the A and remaining B atoms. Provided components A and B 
give rise to clearly observable ionization edges, the orientation dependence of the 
edge can be used to determine whether a given spinel has the normal or inverse 
structure, even if both A and B contain a mixture of two different elements (Tafta 
et al., 1982). 

In mixed-valency compounds, one of the elements (such as a transition metal) is 
present as differently charged ions whose ionization edges may be distinguishable 
as a result of a chemical shift (Section 3.7.4). For example, Fe** and Fe*+ ions in 
chromite spinel give rise to sharp white-line threshold peaks shifted by about 2 eV. 
Taftg and Krivanek (1982b) utilized this chemical shift, together with the orientation 
dependence of the ionization edges, to show that their sample of chromite spinel 
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contained all of the Fe?* atoms on tetrahedral sites and all of the Fe*+ atoms on 
octahedral sites. 

Channeling experiments require specimens containing single-crystal regions as 
large as the incident beam diameter, usually several nanometers, because the con- 
vergence of smaller probes reduces the orientation effect. The specimen thickness 
should be at least equal to the extinction distance & , in order to provide a pronounced 
variation in current density within each unit cell. Otherwise the orientation depen- 
dence will be weak and the measurement prone to statistical error. The extinction 
distance not only is proportional to the incident electron velocity, but also depends 
on crystal orientation and atomic number (Hirsch et al., 1977; Reimer and Kohl, 
2008). For 100-keV electrons and a strong channeling direction, §g ~ 50 nm for 
carbon, decreasing to 20 nm for gold. 


5.6.2 Core-Loss Diffraction Patterns 


In the previous section, we discussed the variation of core-loss intensity as the speci- 
men is tilted, keeping the collection aperture fixed. We now consider the variation in 
intensity with scattering angle for a fixed sample orientation. An amorphous speci- 
men has an axially symmetric scattering distribution; at low energy loss the intensity 
is peaked about the unscattered direction, while for energy losses far above an ion- 
ization threshold it takes the form of a diffuse ring, representing a section through 
the Bethe ridge (Fig. 3.31). In the case of a crystalline specimen, elastic scattering 
results in a diffraction pattern containing Bragg spots (or rings, for a polycrystal) 
and Kikuchi lines or bands. Energy-filtered diffraction patterns can be recorded in a 
scanning transmission electron microscope by rocking the incident beam in angle or 
by using post-specimen deflection coils to scan the pattern across a small-aperture 
detector, but a more efficient procedure is to use a stationary incident beam and an 
imaging filter (Section 2.6). 

At low energy loss, the diffraction pattern resembles the zero-loss pattern, but 
with the diffraction spots broadened by the angular width of inelastic scattering. 
This regime corresponds to a median angle of inelastic scattering (0) less than the 
angular separation between Bragg beams, approximately the lowest-order Bragg 
reflection angle 6g. Taking the delocalization length as L ~ 0.64/(@) (Section 
5.5.3) and using the Bragg equation A = 2d@x, this condition is equivalent to L > d 
or (since the interplanar spacing d is comparable to the lattice constant) localization 
of the inelastic scattering exceeding the unit-cell dimensions. Under these condi- 
tions, inelastic scattering does not greatly change the angular distribution of elastic 
scattering and diffraction contrast is preserved in energy-selected images of defects 
such as stacking faults and dislocations (Craven et al., 1978). 

At higher energy loss, the inelastic scattering becomes highly localized and the 
average inelastic scattering angle exceeds the angular separation of the diffracted 
beams. Bragg spots therefore disappear from the energy-filtered diffraction pattern, 
which starts to resemble the Kossel pattern from an isotropic source of electrons 
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Fig. 5.36 Energy-filtered diffraction patterns of a thin silicon crystal with the incident beam close 
to a (111) axis. (a) Zero-loss pattern, overexposed to show Kikuchi lines resulting from phonon 
scattering. (b)—(e): inelastic patterns recorded at multiples of 100 eV, showing the broadening and 
diminution of diffraction spots, the development of Kikuchi bands, and the emergence of a diffuse 
ring representing the Bethe ridge 


inside the crystal or the Kikuchi pattern obtained from a thick specimen; see 
Fig. 5.36c. If the specimen is sufficiently thin, a diffuse ring representing the Bethe 
ridge is visible at high energy loss, as in Fig. 5.36e. 

By subtracting diffraction patterns recorded just above and below an ioniza- 
tion edge of a chosen element, a core-loss diffraction pattern can be produced. 
Spence (1980, 1981) carried out dynamical calculations to determine the condi- 
tions under which this pattern has the symmetry of the local coordination of the 
selected element, rather than the symmetry of the whole crystal. He found that both 
the localization of the core-loss scattering and the inelastic spread 2t(@) within the 
specimen thickness ¢ must be less than unit-cell dimensions. For Eg = 100 keV 
and an inorganic crystal with small unit cell (0.6 nm), the first condition requires 
Ex > 200 eV; the second implies t < 50 nm for Ex ~ 200 eV or t < 15 nm for 
E; © 1000 eV. The requirements are relaxed for larger unit cells. Core-loss diffrac- 
tion could therefore provide an alternative to channeling and ELNES techniques for 
determining the atomic site of light atoms in a crystal. 

From measurements on LaAlO3, Midgley et al. (1995) concluded that exami- 
nation of HOLZ intensities in a core-loss CBED pattern can be used to determine 
which atomic species (or which sublattice in a complex crystal) contributes most to a 
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particular Bloch state, information that might contribute to the solution of unknown 
crystal structures. Botton (2005) has demonstrated asymmetry in K-loss diffrac- 
tion patterns of graphite recorded at the 1* and o* energies with a tilted specimen 
and has proposed energy-filtered diffraction as a means of studying the bonding in 
anisotropic materials. 


5.6.3 ELNES Fingerprinting 


In compounds containing coordinate bonding, such as minerals and organic com- 
plexes, it is sometimes desirable to know the coordination number and the sym- 
metry of the nearest-neighbor ligands surrounding a metal ion. As discussed in 
Section 3.8.1, the energy-loss near-edge structure (ELNES) of an ionization edge 
represents approximately a local densities of states at the atom giving rise to the 
edge. This interpretation is consistent with multiple scattering (XANES) calcula- 
tions of the backscattering of the ejected electron (Section 3.8.4), which show that 
the scattering is quite localized, involving just a few near-neighbor shells (Wang 
et al., 2008a). 

Sometimes the scattering of the ejected core electron is mostly from a single shell 
of atoms, where these are strongly scattering species such as the O?~ or F~ ions. In 
effect, the ions form a cage or potential barrier that impedes the escape of the inner- 
shell electron (Bianconi et al., 1982). The near-edge fine structure then serves as a 
coordination fingerprint when applied to mineral specimens (Tafté and Zhu, 1982). 
In the case of polymers and macromolecules, the XANES structure can provide a 
fingerprint of the functional groups that act as building blocks for the entire structure 
(Stohr and Outka, 1987). The fact that the scattering is localized allows molecular 
orbital calculations to be used as a basis for interpreting and labeling the peaks 
(Sauer et al., 1993). 

ELNES where the excited atom is in trigonal planar coordination (three nearest 
neighbors lying in a plane) is shown in Fig. 5.37. The carbon K-edge of a carbonate 
group is characterized by a narrow z* peak followed (at 10-11 eV separation) by 
a broader o* peak, quite different from the fine structure of carbon in any of its 
elemental forms. The same kind of K-edge fine structure is observed for prismatic 
(nonplanar) coordination (Brydson, 1991) and for trigonally coordinated boron in 
the mineral vonsenite (Rowley et al., 1990). 

Situations involving tetrahedral coordination are shown in Fig. 5.38. The Si 
L-edge of the SiO4 tetrahedron shows two sharp peaks (separated by ~7 eV) fol- 
lowed by a third prominent broad peak separated about 22 eV from the first peak. 
When observed at 0.5-eV energy resolution, the first peak is seen to contain a smaller 
peak about 1.9 eV lower in energy. A similar peak structure was observed for the 
chlorate, sulfate, and phosphate anions (Hofer and Golub, 1987; Brydson, 1991) 
with some differences of detail. 

Examples of octahedral coordination are shown in Fig. 5.39. A sharp peak (a) is 
followed by two broader peaks (b) and (c), displaced by about 7 and 19 eV. Fairly 
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Fig. 5.37 Carbon K-edges of 
minerals containing the 
carbonate anion, an example 
of trigonal planar bonding, 
compared with the K-edges of 
elemental carbon. Spectra 
(energy resolution 0.5 eV) 
were deconvolved to remove 
plural scattering and peak 
distortion due to the Graphite 
asymmetrical energy 
distribution of the 


carbon-K 


Diamond 


field-emission source. From Amorphous 

Garvie et al. (1994), carbon 

copyright Mineralogical tl TRIGONAL 
Society of America, with Hydrotalcite 


permission 
Desautelsite 


Siderite 


280 290 300 310 
Energy Loss (eV) 


similar structures are observed at the K-edges of MgO, where both Mg and O atoms 
are octahedrally coordinated (Colliex et al., 1985; Lindner et al., 1986). 

Several factors complicate this simple concept of coordination fingerprints. If the 
symmetry of the coordination is distorted, near-edge peaks are broadened or split 
into components (Buffat and Tuilier, 1987; Brydson et al., 1992b). For example, the 
Si L293 edge of zircon shows five peaks in place of the first two that characterize 
SiO4 tetrahedra (McComb et al., 1992). Atoms outside the first coordination shell 
may contribute, which probably explains the two additional peaks seen in the calcite 
carbon K ELNES (Fig. 5.37). In fact, Jiang et al. (2008) found that the most signif- 
icant differences between the ELNES of MgO and Mg(OH)» arose from atoms in 
the second and third coordination shells. Core hole effects and crystal-field splitting 
cause the L3 and Lz edges of transition metals to appear as two components when 
observed at high energy resolution (Krivanek and Paterson, 1990; Krishnan, 1990); 
modeling of these effects could yield information relating to crystal field strength 
(Garvie et al., 1994; Stavitski and de Groot, 2010). 

In addition, several factors can cause apparent differences in structure measured 
from different specimens or in different laboratories. If a thermionic electron source 
(energy width typically 1-2 eV) is used, the energy resolution may be insufficient to 
resolve peak splittings such as those visible in Fig. 5.38. If recorded with electrons 
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Fig. 5.38 Silicon L73 edges 
of three silicates containing 
SiO, tetrahedra, compared 
with the L23 edges of SiC and 
Si. Spectra (energy resolution 
0.5 eV) were deconvolved to 
remove plural scattering and 
peak distortion due to the 
asymmetrical energy 
distribution of the 
field-emission source. From 
Garvie et al. (1994), 
copyright Mineralogical 
Society of America, with 
permission 
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from a cold field-emission source, peak shapes may be distorted because of the 
asymmetry of the emission profile, as seen in the zero-loss peak. If the specimen 
thickness exceeds a few hundred nanometers, the overall shape of the ionization 
edge is modified by plural scattering (Section 3.7.3), increasing the heights of peaks 
occurring 15 eV or more from the threshold. This distortion can be removed by 
Fourier deconvolution, which can also correct for the asymmetry of the emission 
profile; see Appendix B. 

A small collection angle (<15 mrad at 100 keV) simplifies ELNES interpreta- 
tion by ensuring that nondipole transitions are excluded, although these transitions 
might be used creatively to explore the wavefunction symmetry (Auerhammer and 
Rez, 1989). A single crystal is not necessary; in fact, the fine structure may be 
more reproducible if recorded from a polycrystalline area containing several grains, 
suppressing orientation effects (Brydson et al., 1992a, b), or under magic-angle 
conditions (see Appendix A). In principle, crystal anisotropy provides additional 
information about the directionality of bonding but requires careful control over the 
specimen orientation. 

Because the initial-state wavefunction is more localized, higher energy edges 
may offer more characteristic fingerprints. This consideration favors K-edges rather 
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Fig. 5.39 Aluminum L23 
edge in (A) chryoberyl and 
(B) rhodizite, together with 
(C) ICXANES calculations 
for aluminum octahedrally 
coordinated by a single shell 
of oxygen atoms. From 
Brydson et al. (1989), 
reproduced with permission 
from The Royal Society of OCTAHEDRAL 
Chemistry, http://dx.doi.org/ 
10.1039/C39890001010 
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than the low-energy N23 or O23 edges, whose plasmon-like shape is practically inde- 
pendent of the atomic environment (Colliex et al., 1985). But the 1s states for Z > 
14 and 2p states for Z > 28 have natural widths that exceed 0.5 eV (Fig. 3.51a), so 
K-edges above 2000 eV and L53 edges above 1000 eV can be expected to show less 
fine structure. 

ELNES fingerprinting has been applied to several materials science problems. By 
comparing the Al K-edges of ion-thinned specimens of blast-furnace slag cement 
with those recorded from minerals (orthoclase, pyrope, hydrotalcite) and with mul- 
tiple scattering calculations, Brydson et al. (1993) concluded that two phases were 
present: one with Al atoms substituted for Si at tetrahedral sites and the other 
(Mg-rich) phase containing Al at octahedral sites. Bruley et al. (1994) recorded 
Al L93 spectra from a diffusion-bonded niobium/sapphire interface. Agreement was 
obtained with multiple scattering calculations by assuming an interfacial monolayer 
of aluminum atoms tetrahedrally bonded to three oxygen and one Nb atom, with 
AI-Nb bonds providing the cohesive energy. Interfaces produced by molecular beam 
epitaxy showed no modification of the AI-L ELNES at the interface, suggesting that 
the sapphire was terminated by oxygen and that charge transfer from Nb provided 
the cohesive force. Similar spatially resolved ELNES studies were used to establish 
the symmetry and coordination number of Al atoms at a 35.2° grain boundary in 
sapphire (Bruley, 1993). 
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Fig. 5.40 (a) Boron-K 
ELNES of a B-doped Fe/Cr 
oxide layer formed on 193.5 eV 
stainless steel in superheated 
steam. (b) Composite 
spectrum formed by adding 
boron K-edges recorded from 
colemanite (BO3 fingerprint) 
and rhodizite (containing 
BOs) in the ratio 7:3. From 
Sauer et al. (1993), copyright 
Elsevier model spectrum 
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Another example of the use of ELNES is the work of Rowley et al. (1991) on the 
oxidation of Fe/Cr alloys in superheated steam. The presence of boron increases the 
oxidation resistance by creating a diffusion barrier, a thin microcrystalline film of 
composition (M,.B,_,.)203, where M = Fe, Cr, or Mn. The boron K-edge (Fig. 5.40) 
exhibits a sharp peak at 194 eV and a broad peak containing two components cen- 
tered on 199 and 203 eV, believed to represent BO3 and BO, borate groupings 
from comparison with the K-edges of vonsenite and rhodizite. The proportions of 
these two components varied as the electron probe was moved across the sample, 
suggesting that the film consisted of separate phases of MBO3 and M3BOg (Sauer 
et al., 1993). In the case of heavy boron doping, a small prepeak preceding the 194- 
eV maximum was taken to indicate the presence of a boride, whose 2* peak was 
shifted down in energy due to the lower effective charge on the boron atom. 

Using thin films of silicon alloys, Auchterlonie et al. (1989) showed that Si atom 
nearest neighbors (B, C, N, O, or P) can be distinguished using the fine structure 
of the Si L93 edge, as well as from the energy (chemical shift) of the first ELNES 
peak. The bonding type in various forms of carbon and carbon alloys can be mea- 
sured from the 1*/o™ ratio at the carbon K-edge, as discussed in Section 5.7.3. As 
discussed in Chapter 3, ELNES fine structure can also serve as a guide to the den- 
sities of unoccupied states in a solid and can therefore be used as a check on band 
structure calculations. 

Bianconi et al. (1983a) predicted that the energy of the broad shape resonance 
peak (due to transitions to o* states) is proportional to 1/R*, where R is the nearest- 
neighbor distance and the proportionality constant depends on the type of nearest 
neighbors. This simple rule could be useful (as in XAS) for measuring changes in 
bond length. 
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There has recently been much interest in using ELNES to investigate the 
structure of interfaces, particularly within materials used in the electronics indus- 
try. MacKenzie et al. (2006) prepared cross-sectional TEM samples containing a 
Si/Si02/HfO2/TiN/poly-Si gate stack and obtained evidence of interfacial reaction 
between the TiN gate and the surrounding layers, together with a silicon oxyni- 
tride phase at the TiN/Si interface. Other microelectronics examples are given in 
Section 5.7.1. A review of ELNES as a tool for investigating electronic structure on 
a nanometer scale is given by Keast et al. (2001). 


5.6.4 Valency and Magnetic Measurements 
from White-Line Ratios 


As discussed in Section 3.7.1, the Lz and L3 edges of transition metals are char- 
acterized by white-line peaks at the ionization threshold. The energy separation of 
these peaks reflects the spin-orbit splitting of the initial states of the transition; their 
prominence (relative to the higher F continuum and to each other) varies with atomic 
number Z, as seen in Fig. 3.45. 

A decrease in white-line intensity with increasing Z is understandable, since 
the d-states fill up and the density of empty states just above the Fermi level 
decreases. One might expect ly « (10 — ng), where Jy is the sum of the two 
white-line intensities (relative to the continuum) and ng is the d-state occupancy. 
Measurements on metallic films (Pearson et al., 1993) are in approximate agreement 
with this; see Fig. 5.41c, d. Figure 5.41a, b indicates one procedure for obtaining the 
white-line/continuum ratio: the continuum is measured as an integral over a 50-eV 
window, starting 50 eV beyond the edge, and linearly extrapolated to an energy 
loss corresponding to the center of each white line so that the white-line intensity 
(shaded area) can be measured. 

The variation of J,, with ng has been used to measure charge transfer in disordered 
and amorphous copper alloys (Pearson et al., 1994). Corrections were made for 
changes in matrix element and the Cu white-line intensity was measured relative to 
a normalized L23 edge of metallic copper, which has no white lines. Each copper 
atom was found to lose 0.2 + 0.06 electrons when alloyed with Ti or Zr, between 
0 and 0.06 electrons when alloyed with Au or Pt, and between 0 and 0.09 electrons 
when alloyed with Pt. Electron transfer back to copper was observed after the alloys 
became crystalline. 

If the matrix element and final densities of states were the same for all of the 
excited p-electrons, the intensity ratio Ry = I(L3)/I(L2) of the two white-line peaks 
should reflect the degeneracy ratio of the initial-state (2p*/? and 2p!) electrons, 
namely 4/2 = 2; see Appendix D. But as a result of spin-spin coupling, Ry depends 
on the number of electrons in the final (3d) state and therefore varies with atomic 
number and oxidation state (Thole and van der Laan, 1988; de Groot et al., 1991). 
This behavior is illustrated for Co and Mn compounds in Fig. 5.42 and provides an 
alternative means of measuring valency or oxidation state. 
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Fig. 5.41 Procedure used by Okamoto et al. (1992) to measure white-line/continuum ratio R, of 
(a) 4d transition metals and (b) 3d transition metals. Graphs show the variation of R. with (c) 4d 
and (d) 3d occupancy. Copyright, The Metals Society 


Fig. 5.42 White-line 
intensity ratio Ry, for (a) 
cobalt and (b) manganese 
compounds, as a function of 
the cation valence. From 
Wang et al. (2000), copyright 
Elsevier 
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The white-line intensities /(L2) and /(L3) can be determined by curve fitting or by 
a procedure similar to that of Fig. 5.41. The errors involved when different methods 
are applied to manganese compounds are discussed by Ried] et al. (2006). Wong 
(1994) has shown (for the case of nickel and its silicides) that variations in Ry can 
result from solid-state effects, besides variations in d-state occupancy. 

A systematic study of a large number of chromium compounds, covering six 
valences and two different spin states, was reported by Daulton and Little (2006). 
Measurement of L3/L7 ratio employed a method similar to Fig. 5.41, with the ratio 
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between the two continuum backgrounds initially set at 2 but then made equal to the 
derived L3/Ly ratio, this iteration being repeated 10 times. Continuum lines having 
zero slope gave less scatter in the results and were consequently adopted. High- 
spin Cr(II) compounds were easily distinguished by their high L3/Lz ratio (1.9-2.4), 
other groups being hard to distinguish on the basis of L3/Lp ratio alone. 

White-line ratios at the iron L23 edge were used to study the conditions for ferro- 
magnetism in amorphous alloys (Morrison et al., 1985). R, remains approximately 
the same as germanium is added to iron, showing that the d band occupancy is unal- 
tered and that the gradual loss of ferromagnetism cannot be explained in terms of 
charge transfer in or out of the 3d band. However, Ry does change, indicating redis- 
tribution of electrons between the d** and d*” sub-bands with a change in spin 
pairing, which may account for the change in magnetic moment. Measurements on 
a crystalline Cr29Augo alloy showed that the L3/L2 white-line ratio increased by a 
factor of 1.6 compared to pure chromium, indicating a substantial shift in spin den- 
sity between j = 5/2 and j = 3/2 states, which may be the reason for a sevenfold 
increase in the magnetic moment. 

Koshino et al. (2000) recorded L23 edges from vapor-deposited films of phthalo- 
cyanine bonded to various transition metals. They found [y « 3d-state vacancy; 
see Fig. 5.43. Comparison of their measured branching ratio /(L3)/[[(L2) + I(L3)] 
with values calculated by Thole and van der Laan indicated a high spin state for the 
compounds FePc, MnPc, and NiPc. 

L3/Lz ratio can be displayed as an intensity map. This was done using EFTEM 
imaging by Wang et al. (1999) in order to display variations in the valence state of 
Mn and Co in mixed-valence specimens. 


5.6.4.1 Spin-State Measurements 


Small changes (a few percent) in the L3/L» ratio of transition metals have been used 
to determine electron spin state. For this purpose the L-edge has been measured 
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at selected points in the diffraction pattern, in a crystalline specimen of chosen 
thickness. Differences occur because of interference between the inelastically 
scattered waves and can be interpreted in terms of the mixed dynamic form fac- 
tor (Schattschneider et al., 2000). Termed energy-loss magnetic chiral dichroism 
(EMCD) by analogy with x-ray magnetic circular dichroism (XMCD), TEM mea- 
surements offer the promise of higher spatial resolution. Applied to biomineralized 
magnetite crystals in magnetotactic bacteria, a spatial resolution of 2 nm has been 
demonstrated (St6ger-Pollach et al., 2011). Zhang et al. (2009) used EMCD to con- 
firm the ferromagnetic nature of ZnO nanoparticles doped with transition metal 
atoms. Future aims include the study of magnetic properties at interfaces, of vital 
importance for spintronic and magnetic storage devices (Schattschneider, 2011). 

Verbeeck et al. (2010) have advocated the use of a holographic mask, taking the 
form of computer-generated aperture (made by FIB milling of a Pt foil), to produce 
a vortex beam in the TEM with a spiral wavefront and orbital angular momentum. 
The idea was tested with a ferromagnetic iron sample; see Fig. 5.44. Compared to 
the previous angle-resolved method of measuring chirality, this technique promises 
improved signal/noise ratio and greater convenience, as the specimen thickness and 
orientation do not need to be specially chosen. 
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Fig. 5.44 (a) Scheme for dichroic measurement using a computer-generated mask (b) placed a dis- 
tance z beyond focus of a specimen image, together with a spectrometer entrance aperture located 
(c) at the left or right sideband of the far-field diffraction pattern. (d) L3 and Lz edges recorded 
from a 50-nm-thick iron specimen for the two positions of the entrance aperture, together with 
the difference signal that indicates asymmetry of the Am = +1 dipole transitions (Verbeeck et al., 
2010). Copyright 2010, Nature Publishing Group. See also Schattschneider et al. (2008) 
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5.6.5 Use of Chemical Shifts 


The threshold energy of an ionization edge, or changes in threshold between differ- 
ent atomic environments (chemical shift), can provide information about the charge 
state and atomic bonding in a solid. In the past, EELS chemical shift measure- 
ments have been of limited accuracy compared to those carried out by photoelectron 
spectroscopy, but the situation has improved with the development of highly sta- 
ble high-voltage and spectrometer power supplies and dual-recording detectors 
(Gubbens et al., 2010). As discussed in Chapter 3, the EELS chemical shift rep- 
resents a net effect, involving both the initial and final states of a core—electron 
transition. Coordination number also has an influence, accommodated in the concept 
of coordination charge (Brydson et al., 1992b). 

Muller (1999) has argued that for metals the core-loss shift arises mainly from 
changes in valence band width arising from changes in atomic bonding, rather than 
charge transfer. EELS could therefore provide information about the occupied states 
in a metal. While the spatial difference method (Section 4.4.5) can detect core-level 
shifts as small as 50 meV, these shifts could be misinterpreted as indicating a change 
in the density of states at an interface (Muller, 1999). 

A simple example of chemical shift is the change in energy of the z* peak from 
284 eV in graphite to 288 eV in calcite (Fig. 5.37) as a result of highly electroneg- 
ative O atoms surrounding each C atom. Martin et al. (1989) found that the carbon 
K-edge recorded from calcium alkylaryl sulfonate micelles, which contain a cal- 
cium carbonate core surrounded by hydrocarbon molecules, can be represented as 
a superposition of the K-edges of calcite and graphite. Peaks in the carbon K-edge 
fine structure of nucleic acid bases were similarly interpreted by Isaacson (1972b) 
and Johnson (1972) in terms of chemical shifts of the 7* peak, arising from the 
different environments of carbon atoms within each molecule. They reported a peak 
shift proportional to the effective charge at each site (Kunzl’s law). 

For silicon alloys, Auchterlonie et al. (1989) showed that the energy of the first 
peak at the Si L-edge was displaced by an amount proportional to the electronega- 
tivity of the nearest-neighbor atoms (B, P, C, N, and O); see Fig. 5.45. On the basis 
of this shift and the near-edge structure, their amorphous alloys could be uniquely 
identified. Brydson et al. (1992a, b) explained the shape of the oxygen K-edges of 
the minerals rhodizite, wollastonite, and titanite in terms of the potential at each of 
the oxygen sites. 

To simultaneously measure the spectra across gate-dielectric multilayers, Kimoto 
et al. (1997, 1999) used spatially resolved EELS, with a slit placed in front of the 
electron spectrometer. The SREELS technique ensures that high-voltage fluctua- 
tions do not introduce systematic errors, as can happen when spatial resolution is 
achieved by scanning a small probe. 

Daulton and Little (2006) measured the chromium L3 threshold energy of many 
Cr compounds, calibrating their energy-loss spectrometer to 855.0 eV for the Ni-L3 
edge of NiO. Their results were plotted against L3/L, ratio and showed a clear cor- 
relation but considerable scatter, suggesting that other factors (coordination, low- or 
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Fig. 5.45 Energy shift of the 
first peak at the silicon L23 
edge, plotted against Pauling 
electronegativity (relative to 
Si) of ligand atoms. From 
Auchterlonie et al. (1989), 
copyright Elsevier 


Energy Shift (eV) 


0.5 1 1.5 
Electronegativity (eV) 


high-spin configuration, spin-orbit interactions, Coulomb repulsion, and exchange 
effects) influence the L23 edge structure. While it appeared easier to distinguish 
between the common oxidation states, Cr(II]) and Cr(VJ), on the basis of chemical 
shift rather than L3/L2 ratio, the authors argue that both measurements are neces- 
sary for the unambiguous determination of valency in transition metal compounds. 
Daulton et al. (2003) made similar L3/L2 and chemical shift measurements on anaer- 
obic Shewanella oneidensis bacteria, using an environmental cell to keep specimens 
hydrated in the TEM. The data fell within the Cr(II) region (see Fig. 5.46), con- 
firming these bacteria as active sites for the reduction of toxic Cr(VI) species in 
chromium-contaminated water. 


5.6.6 Use of Extended Fine Structure 


As discussed in Section 4.6, a radial distribution function (RDF) specifying inter- 
atomic distances relative to a particular element can be derived by Fourier analysis 
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ratio plotted against L3 
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vacuum (square data points) 
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(central data point with large 
error bars). Each circular data 
point represents the mean and 
standard deviation of 
measurements on a particular 
compound. Reproduced from 
Daulton et al. (2003), with LZ 
permission from Cambridge 
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of the extended fine structure (EXELFS), starting about 50 eV beyond an ioniza- 
tion edge threshold. This procedure has been tested on various model systems and, 
after correction for phase shifts, has yielded first and second nearest-neighbor dis- 
tances that agree with x-ray measurements to accuracies between 0.01 and 0.001 nm 
(Johnson et al., 1981b; Kambe et al., 1981; Leapman et al., 1981; Stephens and 
Brown, 1981; Qian et al., 1995). 

Higher precision is possible when measuring changes in interatomic distance 
in specimens of similar chemical composition. The RDF of sapphire (@-Al2O3) 
and amorphous (anodized) alumina in Fig. 4.22 shows a change in Al-O dis- 
tance of 0.003 nm. No measurable shift in the nearest-neighbor peak occurred after 
crystallizing the amorphous layer in the electron beam, consistent with the crystal- 
lized material being y- rather than a-alumina (Bourdillon et al., 1984). Aluminum 
K-edge EXELFS was used to investigate the structure of ion-implanted w-Al2.O3 
using cross-sectional TEM specimens (Sklad et al., 1992). Implantation at — 185°C 
with 160-keV Fe ions (4 x 10!° cm~?) produced a 160-nm amorphous layer that 
recrystallized epitaxially to w-Al,O3 upon annealing in argon at 960°C. Oxygen- 
edge EXELFS required a restricted energy range because of the presence of an iron 
[3 edge at 708 eV. Implantation with a stoichiometric mixture of Al and O ions 
produced an amorphous layer that recrystallized into a mixture of y-Al2O3 and 
epitaxial wa-Al2O3, as determined from Al K-edge EXELFS; see Fig. 5.47. 
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Fig. 5.47 Nearest-neighbor peak in the RDF obtained from Al K-edge EXELFS of (a) a-Al203 
implanted with 160-keV Fe ions (4 x 10!© cm-?), compared with the crystalline substrate and (b) 
a layer implanted with a stoichiometric mixture of Al and O ions, compared with y-Al2O3 made 
by annealing for 1 h in argon. Data have been corrected for phase shifts using empirical correction 
factors. From Sklad et al. (1992), copyright Taylor and Francis 
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Batson and Craven (1979) used a field-emission STEM to record K-edge 
EXELFS from amorphous carbon films, revealing differences in RDF depending 
on whether the substrate was mica or KCl. These results demonstrate that, even 
with serial recording, EXELFS was able to provide structural information from very 
small areas, below 10 nm in diameter. As always, the fundamental limit to spatial 
resolution is radiation damage, but the problem can be acute in EXELFS studies 
because of the need to achieve excellent signal/noise ratio in the tail of an inner-shell 
edge. 

Noise statistics are improved in the case of lower energy edges; the Si L-edge 
from SiC was found to be sensitive to the atomic environment up to the sixth 
coordination shell (Martin and Mansot, 1991). However, low-energy EXELFS is 
difficult to analyze quantitatively, partly because of the high pre-edge background 
and restricted energy range imposed by other ionization edges. At the opposite 
extreme, EXELFS of the titanium K-edge (4966 eV), measured using 400-keV elec- 
trons and parallel recording (Blanche et al., 1993), yielded results comparable with 
synchrotron radiation EXAFS. Kaloyeros et al. (1988) used boron edge EXELFS to 
investigate the high-temperature stability of amorphous films of titanium diboride, 
made by electron-beam evaporation onto liquid-nitrogen-cooled substrates. 

As an example of a biological application, Fig. 5.48 shows RDFs recorded from 
iron-rich clusters (siderosomes) extracted from lung fluids of a patient suffering 
from silicosis (Diociaiuti et al., 1995). The Fe-L23 RDF is similar to that recorded 
from a hematite standard but the RDF from the oxygen K-edge exhibits a displaced 
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Fig. 5.48 Radial distribution functions for (a) Fe atoms and (c) O atoms, obtained from EXELFS 
of alveolar macrophages (siderosomes). The iron and oxygen RDF recorded from hematite are 
given in (b) and (d), where vertical bars marked X, Z, and W show the expected Fe—O, Fe—Fe, and 
O-O interatomic distances. From Diociaiuti et al. (1995), copyright EDP Sciences (Les Editions 
de Physique) 
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nearest-neighbor peak and a second-nearest-neighbor (O—O) peak of reduced inten- 
sity compared to hematite. These discrepancies were explained by assuming that 
oxygen is also present in a protein coat surrounding the biomineral core, forming 
short O=C bonds that shift the center of the first peak to lower radius. Diociaiuti 
and colleagues (1991, 1992a, b) recorded EXELFS from chromium, copper, and 
palladium clusters in order to quantify the increase in nearest-neighbor distance (up 
to 5%) with decreasing particle size and the effect of oxidation. 

EXELFS measurements as a function of temperature enabled Okamoto et al. 
(1992) to study ordering in undercooled alloys, particularly chemical short-range 
order that is difficult to measure by techniques such as diffuse x-ray scattering. The 
temperature dependence of the mean-square relative displacement (MSRD) was 
used to deduce Einstein and Debye temperatures, good agreement being obtained 
with force-constant theory and with previous experimental data. Differences in 
Einstein temperature between ordered and disordered alloys, reflecting difference 
in vibrational states, are illustrated in Fig. 5.49. Since the EXELFS data can in prin- 
ciple be obtained from small specimen volumes, measurement of a local Debye 
temperature might usefully characterize the defect density at interfaces or in small 
precipitates (Disko et al., 1989). 

To achieve the 0.1% statistical accuracy that is desirable for analyzing EXELFS 
data, as many as 10° electrons need to be recorded within each resolution element 
(typically 2 eV). Careful monitoring of relative peak heights within each readout 
can alert the operator to damage and changes in specimen thickness within the beam 
(Qian et al., 1995). High-voltage and spectrometer drift can be corrected by shifting 
successive readouts back into register. It is important to carefully remove diode array 
dark current and gain variations (Section 2.5.5). 

Even with parallel recording, EXELFS analysis requires an incident electron 
exposure of typically 10~7 C. For a 1-\1m-diameter incident beam, this is equivalent 
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Fig. 5.49 Temperature dependence of the nearest-neighbor mean-square relative displacement 
(MSRD) measured for (a) Al and (b) Fe atoms in pure elements and in chemically disordered 
and ordered Fe3Al. All values are relative to measurements made at 97 K; solid lines represent 
the Einstein model, with an Einstein temperature Tg as indicated. From Okamoto et al. (1992), 
copyright TMS Publications 
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to 1 C/em? dose, enough to destroy the structure of most organic specimens and even 
some inorganic ones (see Section 5.7.3). Because most damage-producing inelastic 
collisions involve energy losses outside the EXELFS region, electrons may produce 
more damage than the monochromatic x-rays used in EXAFS studies (Isaacson and 
Utlaut, 1978; Hitchcock et al., 2008). If there exist materials in which structural 
damage takes place only as a result of inner-shell scattering, this conclusion would 
not apply (Stern, 1982). Provided radiation damage is not a problem, the TEM is 
competitive with synchrotron sources in the sense that the EXELFS recording time 
is typically less than that of EXAFS for ionization edges below 3 keV (Isaacson and 
Utlaut, 1978; Stern, 1982). 

The core-loss intensity is improved by increasing the collection semi-angle 6, 
but at the expense of a higher pre-edge background (Section 3.5). If 6 is too large, 
nondipole contributions complicate the EXELFS analysis, but such effects are usu- 
ally assumed to be small (Leapman et al., 1981; Disko, 1981); see Section 3.8.2. For 
100-keV electrons, a semi-angle in the range 10-20 mrad should allow dipole theory 
to be used, while transmitting typically half of the core-loss scattering. However, this 
large angle will average out the directional dependence of EXELFS (Section 3.9), 
so a much smaller value of 6 is necessary to study the directionality of bonding. 


5.6.7 Electron—Compton (ECOSS) Measurements 


Electron energy-loss spectra are most often recorded with a collection aperture cen- 
tered on the optic axis, around the unscattered beam. If this aperture is displaced or 
the incident beam tilted through a few degrees so that only large-angle scattering is 
collected, a new spectral feature emerges at high energy loss in the form of a broad 
peak; see Fig. 5.50. Known as an electron-Compton profile, this peak represents a 
cross section (at constant scattering vector q) through the Bethe ridge; see Figs. 3.31 
and 3.36. Its center corresponds to an energy loss E that satisfies Eq. (3.132), the 
scattering angle 0, being determined by the collection aperture displacement or the 
tilt of the incident beam. For sin 6, << 1 and E << £o, Eq. (3.132) is equiv- 
alent to the relation E = y*T62 for Rutherford scattering from a free stationary 
electron. 

The fact that the atomic electrons are not free is indicated by the width of the 
Compton profile, which is a measure of the electron binding energy. Therefore, 
although the peak contains overlapping contributions from both outer- and inner- 
shell electrons, the latter give a much broader energy distribution and contribute 
mainly to the tails of the profile. Conversely, the central region represents mainly 
scattering from bonding (valence) electrons. 

The spread of the Compton peak can also be thought of as a Doppler broaden- 
ing due to the “orbital” velocity or momentum distribution of the atomic electrons, 
closely related to the electron wavefunctions. Quantitative interpretation (Williams 
et al., 1981) is rather similar to the Fourier method of EXELFS analysis. After sub- 
tracting the background contribution from the tails of lower energy processes, the 
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Fig. 5.50 (a) Energy-loss spectrum of amorphous carbon, recorded using 120-keV electrons scat- 
tered through an angle of 100 mrad. Zero-loss and plasmon peaks are visible (because of elastic 
and multiple scattering), in addition to the carbon L-edge and Compton peak. The dashed curve 
shows a computed free-atom Compton profile. (b) Reciprocal form factor derived for amorphous 
carbon (solid line) and for two basal plane directions in graphite. From Williams and Bourdillon 
(1982), copyright IOP Publishing 


energy scale is converted to one of momentum (or wave number k) of the atomic 
electron. The Fourier transform of the resulting profile (Fig. 5.50b) is known as a 
reciprocal form factor B(r) and is the autocorrelation function of the ground-state 
atomic wavefunction, in a direction specified by the scattering vector q (ie., by 
the azimuthal location of the detector aperture). If the specimen is an insulating or 
a semiconducting crystal, zero crossings in B(r) are expected to coincide with the 
lattice spacings. 

The optimum scattering angle for recording the electron-Compton profile from 
a light element sample (such as graphite) appears to be about 100 mrad for 
60-100 keV electrons, resulting in a profile whose center lies at about | keV loss. 
An energy resolution of 10 eV is sufficient, but should be combined with an angu- 
lar resolution of about 3 mrad (Williams et al., 1984). The signal/background ratio 
at the Compton peak is maximized by making the sample as thin as possible, 
indicating that the background arises from plural or multiple scattering, chiefly 
large-angle elastic (or phonon) scattering accompanied by one or more small- 
angle (low-loss) inelastic events. Because of the large width of the Compton peak, 
accurate subtraction of the background is not easy. To obtain a tolerably low back- 
ground, the specimen thickness should be less than about 30 nm in the case of 
100 keV incident energy and a low-Z element such as carbon. Even thinner sam- 
ples are appropriate for higher Z elements and would result in longer recording 
times. 

Because of the need for both angular and energy discrimination, the collec- 
tion efficiency of the Compton signal is of the order of 10~°, assuming parallel 
recording. Adequate statistics within the Compton peak involve recording at least 
10° electrons, requiring an incident exposure +3 x 10~3C, equivalent to a dose of 
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0.4 C/cem? for a 10-\1m incident beam. Radiation damage and specimen contamina- 
tion are therefore potential problems. Gain variations between diode array elements 
should be removed (Jonas and Schattschneider, 1993). 

Electron-Compton measurements have shown that the bonding in arc-evaporated 
carbon films is predominantly graphitic (Williams et al., 1983) and revealed appar- 
ent inadequacies in band structure calculations for graphite (Vasudevan et al., 1984). 
By using single-crystal silicon specimens with [100] and [111] orientation and plac- 
ing the collection aperture at selected points in the diffraction pattern, anisotropies in 
the electron-momentum distribution have been measured and compared with y-ray 
experiments and with theory (Jonas et al., 1992; Schattschneider and Exner, 1995). 


5.7 Application to Specific Materials 


In this final section, we review some applications of EELS in selected areas of chem- 
istry and materials science. Answers to the relevant questions have often required 
low-loss spectroscopy, core-loss spectroscopy, and fine structure analysis, in addi- 
tion to TEM imaging, electron or x-ray diffraction, and other analytical techniques. 
The topics chosen here represent a personal choice; there have been significant 
applications in other fields such as metallurgy (Okamoto et al., 1992; Craven et al., 
2008), advanced materials (Zaluzec, 1992), and catalyst studies (Wang et al., 1987; 
Bentley, 1992; Crozier et al., 2008; Zhao et al., 2010). 


5.7.1 Semiconductors and Electronic Devices 


As semiconductor devices have shrunk toward nanometer dimensions, TEM-EELS 
has become increasingly useful for studying their material properties. For example, 
it is important to know how the bandgap of a semiconductor or a dielectric changes 
within a device (Gu et al., 2009). Dipole transitions in a semiconductor or insulator 
do not occur unless the energy supplied exceeds the direct bandgap, typically 1 eV 
or more. Therefore the energy-loss spectrum should show zero intensity, followed 
by a sharp rise at the bandgap energy Eg, providing a measurement of Eg. 

In practice, there are two problems. First, the zero-loss peak is very strong in thin 
specimens and its tails can extend to several electron volts (Bangert et al., 1997). 
These tails can be minimized by using a TEM fitted with a monochromator and by 
setting the parallel-recording spectrometer to a high enough energy dispersion (low 
electron volt/channel). Even so, it is usually necessary to remove the high-F tail 
by fitting and subtraction or by Fourier ratio deconvolution (Stéger-Pollach, 2008). 
Park et al. (2009) achieved consistency in measuring the bandgap of SiO, films of 
different thicknesses by performing linear fits to the intensity rise and its preceding 
background, choosing Ey from the intersection point; see Fig. 5.51. 

The second problem arises from surface plasmon and Cerenkov modes of energy 
loss, which add peaks in the region of a few electron volt. Both effects involve very 
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Fig. 5.51 Bandgap 
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small scattering angles, so they can be minimized by using an annular or off-axis 
collection aperture or by subtracting spectra recorded with small and large collection 
apertures (Stoger-Pollach 2008). Zhang et al. (2008) subtracted spectra recorded 
with two different collection angles in order to measure the dielectric function of 
diamond. 

In fact, radiation losses within the bandgap region (where other loss processes 
are largely absent) could provide useful information about artificial nanostructures, 
since their intensity distribution is directly related to the optical densities of states 
(Garcia de Abajo et al., 2003). The ability of the TEM to examine properties at the 
nanometer level offers the opportunity to examine defects within such structures 
(Cha et al., 2010). 

Rafferty and Brown (1998) pointed out that the low-loss fine structure represents 
a joint density of states multiplied by a matrix element that differs in the case of 
direct and indirect transitions. Assuming no excitonic states, their analysis showed 
that the onset of energy-loss intensity is proportional to (E — eye ? for a direct gap 
and (E — E,)*/ ? for an indirect gap. 

Cubic GaN is a direct gap semiconductor whose inelastic intensity, after sub- 
tracting the zero-loss tail, fits well to (E — Be)" 2? for E > Eg (Lazar et al., 2003). 
Residual intensity below E, might arise from indirect transitions in a surface oxide 
layer, as suggested by (E — E,)/ ? fitting; see Fig. 5.52a. The anatase phase of TiO2 
is an indirect-gap material (Eg ~ 3.05 eV), so Wang et al. (2008b) matched the 
intensity between 3 and 6 eV to (E — Ey ; see Fig. 5.52b. In the case of a related 
hydroxylated material H2Ti307 (in the form of 8-mm-diameter multiwalled nan- 
otubes), additional intensity occurred below 4 eV, suggesting transitions involving 
defect states introduced by the hydroxyl groups; see Fig. 5.52c. The spectra were 
recorded with an off-axis aperture (g ~ 1 nm7!) to minimize surface plasmon and 
Cerenkov contributions. 
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Fig. 5.52 (a) Inelastic intensity recorded from cubic GaN using a monochromated TEM-EELS 
system. The curves show fitted functions appropriate to direct and indirect electronic transitions; 
from Lazar et al. (2003), copyright Elsevier. (b) Spectra recorded from TiO2 and (c) from a 
H2Ti307 nanotube, at various scattering vectors g (in nm'). Insets show fits to a (E-Eg)!> func- 
tion. Reprinted with permission from Wang et al. (2008b). Copyright 2008, American Institute of 
Physics 


Bandgap maps can be produced by analyzing the low-loss data at each pixel 
(Tsai et al., 2004). However, delocalization of the inelastic scattering limits the spa- 
tial resolution to a few nanometers (Couillard et al., 2007, 2008), as discussed in 
Section 5.5.3. Bosman et al. (2009) concluded that bandgap mapping in the near- 
visible region may not be feasible for embedded layers or particles less than about 
10 nm in dimension. 

Crystalline defects adversely affect the electronic properties of semiconductors. 
High-resolution TEM can be used to investigate the atomic arrangement of indi- 
vidual defects, while EELS gives information on their local electronic structure 
and bonding. Combining this information can lead to an understanding of how the 
atomic structure and physical properties are related. Although strong diffraction in 
a crystalline specimen makes the interpretation of inelastic scattering more compli- 
cated, channeling directs the electrons down atomic columns (if the beam is aligned 
with a crystal axis) and improves the spatial resolution (Loane et al., 1988). The 
STEM uses an annular dark-field (ADF) detector to collect high-angle elastic scat- 
tering, giving a high-resolution image with good atomic number contrast that allows 
the electron beam to be positioned on an atomic column and held there long enough 
to acquire useful spectra if specimen drift and radiation damage are sufficiently low 
(Batson, 1992a, 1993c, 1995; Browning et al., 1993b; Lakner et al., 1992; Kimoto 
et al., 2007; Muller et al., 2008). 

An early use of STEM to investigate semiconductor defects is shown in Fig. 5.53 
(Batson et al., 1986). The large tail of the zero-loss peak was subtracted from the 
low-loss data and the inelastic intensity fitted to (EF — Ey, With the STEM probe 
located at a misfit dislocation, the effective energy gap Eg is reduced, the additional 
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Fig. 5.53 Change in the intensity of inelastic scattering when a STEM probe is positioned on a 
misfit dislocation in GaAs. From Batson et al. (1986), copyright American Physical Society. http:// 
link.aps.org/abstract/PRL/v57/p2729 


low-loss scattering likely arising from electron excitation from filled states (at the 
dislocation core) to the conduction band. Because these states depend on the struc- 
ture of the dislocation, this measurement points to the possibility of using EELS to 
determine the nature of individual dislocations. Takeda et al. (1994) reported that 
line defect self-interstitials in silicon (identified by HREM imaging) give rise to 
an energy-loss peak at 2.5 eV. Tight-binding calculations showed this peak to be 
consistent with the presence of eight-membered rings. 

The local dopant concentration in some semiconductor devices has become high 
enough to be detectable by EELS. Servanton and Pantel (2010) have shown that 
STEM-EELS (based on the As-Zo3 ionization edge) can map the distribution of 
arsenic dopant in silicon BiCMOS transistors and static RAM. Their sensitivity was 
in the low end of the 10!? cm” range, with a spatial resolution about 2 nm; see 
Fig. 5.54. Line scans showed good quantitative agreement with secondary ion mass 
spectrometry. The measurements were made with an incident energy of 120 keV, 
below the knock-on threshold for silicon. 

According to Eq. (4.33), Kramers—Kronig analysis of the low-loss spectrum 
yields an electron concentration, whose value can then be used in Eq. (3.41), 
together with the measured plasmon energy to give an electron effective mass 
(related to bandgap and carrier mobility). In a semiconductor, the main plasmon 
peak represents resonance of the valence electrons, whose effective mass may dif- 
fer from that of the conduction electrons. Nevertheless, Gass et al. (2006a) reported 
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Fig. 5.54 (a) TEM image and (b) arsenic concentration map of a 90-nm n-p-n BiCMOS transistor 
(150 x 60 pixels) acquired in 2.2 h by STEM-EELS. (c) Arsenic concentration profile along the 
line CC . From Servanton and Pantel (2010), copyright Elsevier 


agreement with the cyclotron-resonance mass for GaAs and GaN, then used their 
STEM-EELS system to produce maps of effective mass in InAs quantum dots and 
GaInNAs quantum wells. 

The properties of field-effect transistors depend greatly on the interface between 
the gate dielectric and the semiconductor. There have been several TEM-EELS stud- 
ies of these interfaces. For example, Park and Yang (2009) used Kramers—Kronig 
analysis to determine the dielectric function across a HfO>/Si interface, obtaining 
good agreement with density functional calculations for HfO2 away from the inter- 
face. Gatts et al. (1995) applied neural pattern recognition to the analysis of a series 
of low-loss spectra recorded across a Si/SiO2 interface. First-difference spectra were 
fitted to Si and SiOz standards, with a third component representing the interface 
loss. 

To lower gate voltage and power dissipation, the thickness of the FET gate insula- 
tor has been reduced to values approaching 1 nm. In the case of SiOz, this thickness 
corresponds to only five Si atoms and the properties of the oxide might be expected 
to depart from those of the bulk solid. High-resolution STEM of cross-sectional 
specimens, combined with core-loss EELS, provides a way of investigating these 
effects. Figure 5.55 shows how the background-subtracted oxygen K-edge changed 
when a STEM probe approached the Si interface from a native oxide. The threshold 
shifted downward by 3 eV, indicating a reduced bandgap, and the sharp threshold 
peak disappeared. Since this peak arises from strong backscattering of the ejected 
core electron from nearest-neighbor O atoms, its absence near the interface indi- 
cates a silicon-rich environment: a sub-oxide SiO, where x < 2. Similar results were 
obtained for the thermally grown oxide used in device fabrication. From analysis 
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Fig. 5.55 Oxygen K-loss 200 
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of the two ELNES components, the sub-oxide was estimated to have a width of 
0.75 nm, compared to a total thermal oxide width of 1.6 nm (Muller et al., 1999). 

EFTEM imaging can provide a relatively rapid way of obtaining elemental maps 
of semiconductor devices, revealing their structure more clearly than a conventional 
TEM image; see Fig. 5.56. Botton and Phaneuf (1999) used GIF-EFTEM imag- 
ing with a narrow (4-eV) energy window to distinguish between oxide and nitride 
regions in a DRAM transistor, exploiting the larger chemical shift of silicon oxide 
compared to the nitride. Energy-filtered TEM has also been used to display elemen- 
tal distributions and composition gradients in semiconductor multilayers (Jager and 
Mayer, 1995; Liu et al., 1999). 


Fig. 5.56 (a) Low-magnification zero-loss image of a trench capacitor (part of a 64-MB DRAM 
chip). (b) Elemental map showing the distribution of silicon, nitrogen, and oxygen in this region. 
From Botton and Phaneuf (1999), copyright Elsevier 
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5.7.2 Ceramics and High-Temperature Superconductors 


The performance of many technological materials is dependent on the properties 
of their internal surfaces (Brydson et al., 1995). In the case of Si3Nq structural 
ceramics, the mechanical and thermal properties are controlled by the atomic and 
electronic structure of the crystal/amorphous interface, which may undergo inter- 
atomic mixing and partial ordering involving heavy cations. Walkosz et al. (2010) 
used EELS to obtain information about the position of light atoms at the B-Si3N4- 
SiOz interface. Figure 5.57a shows a atomic resolution bright-field image that 
reveals the presence of short-range ordering between the crystalline B-Si3N4 and 
amorphous SiO). To identify the composition along and across the interface, atomi- 
cally resolved EELS was carried out at 80 kV (avoiding radiation damage) by raster 
scanning an aberration-corrected probe while collecting a spectrum (in 0.05 s) at 
each pixel. Figure 5.57d, e shows the Si L23 edges taken from six different posi- 
tions, identified in the simultaneously acquired Z-contrast image (Fig. 5.57f) and 
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Fig. 5.57 (a) A 300-kV bright-field image (contrast inverted) of a B-Si3N4-SiO> interface; (b) 
and (c) are intensity profiles at positions P1 and P2, respectively; (d) and (e) show the Si L-edge 
acquired at the six different positions shown in the Z-contrast image (f), with L-edges of bulk Si3N4 
and SiO2 for comparison. (g) shows O/N ratios at the six positions, normalized with respect to the 
maximum (position 2). Dotted lines in (a) and (d) mark the interface between the Si3N4 grain and 
the amorphous SiOp film; the scale bars are 0.5 nm in length. Figure adapted from Walkosz et al. 
(2010), copyright American Physical Society. http://link.aps.org/abstract/PRB/v82/p08 1412 


5.7 Application to Specific Materials 375 


corresponding to the Si sites in the bulk Si3N4q. Silicon signals at all six positions 
displayed both Si-N and Si-O bonding features (peaks a, b, and c of the Si3N4 and 
SiOz reference spectra) but those at positions 1, 2, 3, and 4 reveal stronger Si-O 
bonding since the peaks at 105 eV (labeled b) are more pronounced. The Si signals at 
positions 4 and 6, representing the two ends of the terminating Si3N, structure, have 
slightly different features, indicating distinct bonding characteristics. To investigate 
further, O and N K-edges were acquired simultaneously with the Si £93 edge, and by 
integrating these signals over a 40-eV window, elemental O/N ratios were computed, 
as shown in Fig. 5.57g. The ratios are different at the two ends of the terminating 
Si3Nq structures (positions 4 and 6), suggesting a compositional asymmetry at the 
Si3Nq surface and implying site-specific intermixing across the interface. 

Klie and Zhu (2005) and Klie et al. (2008) have reviewed EELS and atomic 
column STEM imaging of ceramics, including their studies of dislocations present at 
8° tilt boundaries in SrTiO3. At a dislocation core, the Ti L-edge integral was found 
to be 21% higher than in the bulk; its threshold was shifted 0.8 eV lower in energy, 
indicating a Ti valency of 3.6 + 0.2; the f2 g peaks were suppressed, consistent with 
a Ti valency below 4. The oxygen-K ELNES structure was also different at the core, 
suggesting an excess negative charge. 

Some ceramics have applications as thermoelectric materials. Klie and Qiao 
(2010) have shown how EELS and annular bright-field STEM can be useful for 
determining the effects of structural disorder, strain, and charge transfer on the ther- 
moelectric properties of the layered oxide material Caz3Co4O9. STEM spectroscopic 
imaging has been used to investigate atomic-scale interdiffusion at oxide interfaces 
(Fitting Kourkoutis et al., 2010). 

Following their discovery in 1987, cuprate superconductors were extensively 
studied by TEM-EELS, including yttrium barium cuprate (YBazCu3O07_;, abbre- 
viated as YBCO). These ceramics contain CuO2 planes perpendicular to the c 
direction, with oxygen and metal atoms in between. Superconductivity involves 
Cooper pairs of valence band holes and is associated with the CuO planes, where 
Cu-3d and O-2p states lie close to the Fermi level. Taking advantage of the dipole 
selection rule (Section 3.7.2), the unoccupied part of these states can be investigated 
by examining fine structure of the copper L23 and oxygen K-edges. 

Figure 5.58a shows the onset of the oxygen K-edge recorded from 
YBaz2Cu307_3. For small oxygen deficiency (5 ~ 0.2), two pre-edge features are 
visible: a shoulder around 535 eV, which represents transitions to unoccupied Cu- 
3d-states (upper Hubbard band), and a peak around 528 eV representing transitions 
to O-2p states that give rise to holes in the valence band. As the oxygen deficiency 
6 increases, the 528-eV peak falls in intensity, indicating a decrease in hole con- 
centration and a reduction in the superconducting critical temperature. For 5 > 0.6, 
the prepeak is absent, corresponding to zero hole concentration and a material that 
behaves as an electrical insulator at all temperatures. 

By appropriate choice of the scattering angle and specimen orientation, it is pos- 
sible to select the direction of the momentum transfer q that contributes to the 
energy-loss spectrum and observe significant differences in pre-edge structure; see 
Fig. 5.58b. These observations were used to determine final-state symmetries in 
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Fig. 5.58 (a) Onset of the oxygen K-edge of YBCO at four values of the oxygen deficiency 6 
(Fink, 1989). (b) Oxygen K-edge recorded at scattering angles such that the momentum transfer 
was either parallel or perpendicular to the c-axis. From Fink et al. (1994), copyright Elsevier 


several oxide superconductors, information that proved useful in the comparison 
of different models of electrical conduction (Fink et al., 1994). The g-dependence 
(dispersion) of the plasmon peak has been measured for BizSrz2CaCu2Og and inter- 
preted in terms of the multilayered structure of the material (Longe and Bose, 1993). 
Using a field-emission TEM to examine Ba;_,K,BiO3, Wang et al. (1993) identi- 
fied a free-carrier plasmon around 2 eV that abruptly increased in strength at the 
semiconductor/metal transition (x > 0.3). The dielectric function of YBCO shown 
in Fig. 5.59 was derived by Kramers—Kronig analysis of EELS data and has been 
discussed in terms of band structure (Yuan et al., 1988). 

The superconducting properties of these ceramics depend strongly on the pres- 
ence of defects, such as grain boundaries in a polycrystalline material. Making use 
of the high spatial resolution of TEM-EELS, the oxygen-edge prepeak can be used 
as a sensitive measure of local oxygen concentration. From EELS measurements 
made using a 2-nm electron probe, Zhu et al. (1993) concluded that grain bound- 
aries in fully oxygenated YBCO fall into two categories, with different structural 
misorientation between the grains: those in which the 529-eV peak retains its inten- 
sity across the grain boundary and those in which the intensity falls practically to 
zero. The implication is that some boundaries are fully oxygenated and would trans- 
mit a superconducting current at temperatures below 90 K, while others contain an 
oxygen-depleted region (width 10 nm or less) and may act as weak links. Similar 
conclusions have been reached on the basis of EELS combined with Z-contrast 
STEM imaging (Browning et al., 1993a). From analysis of the oxygen K-edge struc- 
ture, Browning et al. (1999) established the presence of a charge depletion layer at 
the grain boundaries of YBazCu307_; and (Bi/Pb)2Sr2Ca2Cu30j9. They found that 
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Fig. 5.59 Energy-loss function (solid curve) together with real and imaginary components of the 
permittivity « of YBCO superconductor, derived from low-loss EELS. Peaks A and B are attributed 
to valence electron plasmon excitation and peak C to yttrium 4p—d transitions (N23 edge). From 
Yuan et al. (1988), copyright IOP Publishing. http://iopscience.iop.org/0022-3719/21/3/008 


the width increased with misorientation angle and concluded that the critical current 
is limited by tunneling across this depletion zone. 

From measurements of the 529-eV peak, Dravid et al. (1993) showed that the 
oxygen content in YBa2Cu307_5 may vary along a grain boundary plane (6 fluc- 
tuating between 0.2 and 0.4 over distances of the order of 10 nm) and proposed a 
modification to the Dayem bridge model for grain boundary structure. Browning 
et al. (1991a) applied the same technique to reveal variations in oxygen content 
within a grain, with an estimated accuracy of 2%. EELS has also been used to detect 
the presence of carbon at grain boundaries of densified YBCO; examination of the 
carbon and oxygen K-edge structure indicated the presence of BaCO3 (Batson et al., 
1989). 

Tlo.5Pbo,5Caj_x Y,St2Cu2O7_; is another high-temperature superconductor that 
has been examined by EELS (Yuan et al., 1991). Its oxygen K-edge shows a prepeak 
at 529 eV, believed to represent transitions to conduction band states and not an 
indication of superconductivity. However, a shoulder on the low-energy side of the 
peak (attributed to transitions to hole states near the Fermi level) appeared to be 
characteristic of the superconducting (x < 0.4) material. Core-loss measurements on 
Y ~xCa,Sr2Cu2GaO7 (Dravid and Zhang, 1992) also found two pre-edge features: 
a broad peak (dependent on Ca doping) centered around 528.2 eV, associated with 
normal conductivity and attributed to holes on oxygen sites that are not on Cu2O 
planes, and a smaller peak around 527.1 eV, probably associated with holes on Cu2O 
planes and superconductivity. 

Attempts were made to fabricate Josephson junctions by depositing multilayer 
structures of oxide superconductors. However, the c-axis tends to lie perpendicular 
to the film plane and the coherence length is very short (<I nm) in this direc- 
tion, placing extreme limits on the sharpness of the junction. If a film could be 
depleted in oxygen by means of an electron probe of sub-nanometer dimensions, the 
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insulating gap would form a weak-link structure. Unfortunately, fully oxygenated 
YBCO appears to retain its oxygen content even after high electron dose, although 
an electron beam has been used to cut nanometer-wide channels in an amorphous 
phase of similar composition (Humphreys et al., 1988; Devenish et al., 1989). 

MgBz was discovered in 2001 to be superconducting below 39 K. Oxygen doping 
not only provides flux-pinning centers in the form of nano-precipitates inside the 
MgBz grains, but also leads to MgO or BO, precipitates at the grain boundaries. 
By recording spectra for different crystal orientations and collection angles, Klie 
et al. (2003) showed that a prepeak at the boron K-edge largely represents the pyy 
states that are believed to play an important role in superconductivity. There is also 
interest in iron-pnictide materials as high-T, superconductors and in the case of 
NdFeAsO, Idrobo et al. (2010) report that the Fe L3/Lz and Nd Ms/Mg ratios vary 
with crystallographic orientation and specimen temperature, these changes being 
correlated with changes in electronic structure. 


5.7.3 Carbon-Based Materials 


Carbon is a uniquely important element and of practical interest as a result of the 
development of hard diamond-like coatings and the discovery of fullerenes (C¢0, 
etc.) in 1985, carbon nanotubes in 1991, and more recently single-layer graphene 
and its derivatives. 

Graphene has low-loss and core-loss spectra as shown in Fig. 5.60. When the 
thickness of a graphite specimen is reduced, the bulk plasmon (7 and 27 eV) peaks 
eventually become redshifted, to about 5 and 14.5 eV in the case of a single graphene 
layer. These shifts are in substantial agreement with calculations made using local 
density functional code (Eberlein et al., 2008). They might also be seen as a change 
from bulk to surface plasmons, bearing in mind that the two surface modes are 
highly coupled and that dispersion makes the peak g-dependent (Section 3.3.5). 
Calculated energy-loss functions for graphene and graphite are shown in Fig. 5.61. 
The out-of-plane (q||c) mode approaches zero in single-layer graphene, whose 
m-plasmon exhibits a linear dispersion, from 5.1 eV at g = 1 nm! to 6.7 eV atg = 
4nm7! (Lu et al., 2009). Linear dispersion is also observed for two-layer material 
but is closer to quadratic for three layers. Single-layer material can be distinguished 
form the fact that its electron diffraction pattern has no higher order Laue zones and 
varies little with specimen orientation (Wu et al., 2010). Graphene becomes dam- 
aged as a result of knock-on processes at incident electron energies above about 
60 keV. 

Carbon nanotubes are rolled-up graphene sheets capped with fullerene-like 
end structures and exist in single-wall (SWCNT) and multiwall (MWCNT) form. 
They display nondispersive excitations, whose energies are related to the elec- 
tronic density of states and dispersive excitations related to a collective excitation 
of the m-electrons polarized along the nanotube axis. Despite the small dimen- 
sions, dielectric theory appears to apply approximately; the dielectric function has 
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Fig. 5.60 (a) Low-loss 
spectra of single-, double-, 
and five-layer unsupported 
graphene, recorded using 
100-keV electrons (Gass 
et al., 2008). (b) Carbon 
K-edge recorded using 
200-keV electrons from 
single-layer freestanding 
graphene. The K-edge of a 
double layer appeared similar. 
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Fig. 5.61 Energy-loss 
function calculated for a 
single layer and multilayers 
of graphene with q||c (top) 
and qe (bottom).The x-axis 
represents energy loss in eV 
and the y-axis is in arbitrary 
units. From Bangert et al. 
(2008), copyright © 2009 
WILEY-VCH Verlag 

GmbH & Co. KGaA, 0 
Weinheim 
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been calculated from Kramers—Kronig analysis of EELS data (Pichler et al., 1998). 
The SWCNT m-resonance energy (15 eV) is close to that of single-layer graphene 
and shows linear dispersion, whereas interband transitions below 4 eV show no 
dispersion (Pichler et al., 1998). As the wall thickness increases, the dielectric prop- 
erties become closer to those of graphite (St6ckli et al., 1997). A MWCNT peak 
at 19 eV has been identified as a radial surface plasmon mode (Stephan et al., 
2002). Especially for larger diameter tubes, the plasmon energy depends more on 
the number of graphene layers than on the overall diameter (Upton et al., 2009); see 
Fig. 5.62. EELS fine structure below 5 eV shows agreement with optical data and 
DOS calculations (Sato et al., 2008b). Zobelli et al. (2007) have shown that carbon 
and boron nitride nanotubes become damaged by a knock-on mechanism at incident 
electron energies above 80 keV. Nanotube bundles have also been investigated by 
EELS (Reed and Sarikaya, 2001). 

Fullerenes were discovered in soot condensed from carbon vapor and found to 
be molecules comprised of graphite-like sheets bent into spherical or ellipsoidal 
shapes. The solid form (fullerite) can be extracted with benzene or by subliming 
the deposit onto a substrate to create a thin film. The energy-loss spectrum of Cg 
fullerite shows a main (o +77) plasmon peak at 25.5 eV, a m-resonance peak at 
6.4 eV, and several subsidiary peaks (Hansen et al., 1991; Kuzuo et al., 1991). The 
peaks of other fullerites, such as C79, C76, and Cg4, are shifted in energy and their 
fine structure is different (Terauchi et al., 1994; Kuzuo et al., 1994). The carbon 
K-edge structures are also distinguishable and unlike that of graphite (Fig. 5.63), 
so EELS is useful for identifying small volumes of these materials. Fullerenes are 
damaged by electron doses above about 100 C/cm?, apparently by radiolysis rather 
than knock-on displacement (Egerton and Takeuchi, 1999). They can be polymer- 
ized, for example, with UV light, creating materials with a smaller bandgap and a 
reduced K-edge m™* peak (Terauchi et al., 2005). Anisotropic dielectric theory has 
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Fig. 5.62 Low-loss spectra 
of carbon nanotubes, 
calculated from dielectric 
theory for external radius R = 
20 nm and six values of 
internal radius r, from | to 

19 nm. From Stephan et al. 
(2002), copyright American 
Physical Society. http://link. 
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Fig. 5.63 (a) Low-loss and 
(b) K-loss spectra of 
fullerites, compared with 
graphite. From Kuzuo et al. 
(1994), copyright American 
Physical Society. http://link. 
aps.org/abstract/PRB/v49/ 
p5054 
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been developed (Stéckli et al., 1998) to describe plasmon excitation in multiwalled 
carbon spheres (carbon onions). 

Diamond combines high hardness, thermal conductivity, and refractive index 
(2.4) with very low electrical conductivity and transparency to visible light. Natural 
diamond is classified on the basis of infrared spectra: unlike their type II cousins, 
type I diamonds contain an appreciable amount of nitrogen, either segregated 
(type Ia) or dispersed (type Ib). In the former case, TEM reveals the presence of 
10-100 nm platelets lying on {100} planes, while EELS measurements (Bruley, 
1992; Fallon et al., 1995) have shown that the nitrogen content of the platelets can 
vary between 0.08 and 0.47 monolayer. The nitrogen and carbon K-edge ELNES 
were similar, suggesting that N and C atoms have the same local environment 
and that N atoms are present as isolated substitutional impurities; see Fig. 5.64a. 
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Fig. 5.64 (a) Nitrogen K-edge recorded at a platelet, scaled to match the carbon K-edge from 
nearby diamond. The shapes are basically similar but the K-edge appears to have higher intensity 
within 20-30 eV of the threshold (Fallon et al., 1995). (b) Nitrogen K-edge recorded from a region 
of diamond containing a voidite. From Luyten et al. (1994). Copyright Taylor and Francis 
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Additional scattering around 5 eV at the platelets has been interpreted in terms of 
localized states arising from partial dislocations (Bursill et al., 1981). 

A small rise in intensity occurring about 5 eV before the carbon K-edge (e.g., 
Fig. 5.37) is more prominent in thin specimens and may be associated with surface 
states within the bandgap or z™ levels of a graphitic surface layer. Using spatial- 
difference EELS, Bruley and Batson (1989) detected additional intensity in the 
pre-edge region when the electron beam was placed close to a dislocation, perhaps 
indicating excitation to defect or impurity states. The presence of a monolayer of 
oxygen at the {111} free surface of diamond has been deduced from the observa- 
tion of an oxygen K-edge in reflection mode energy-loss spectra (Wang and Bentley, 
1992). 

Some natural diamonds contain octahedral-faceted inclusions, a few nanometers 
in size, known as voidites. Bruley and Brown (1989) showed that some voidites 
contain nitrogen, a sharp peak at the ionization threshold indicating Nz rather than 
NHs (previously proposed). The nitrogen concentration appeared to be independent 
of voidite size, its average value being about half the carbon concentration in dia- 
mond. Despite the high pressure involved, the nitrogen did not appear to be metallic, 
as evidenced by the lack of additional intensity below 5 eV. No diffraction spots 
were observed, suggesting that nitrogen is present in an amorphous phase. Luyten 
et al. (1994), however, found moiré fringes and tetragonal-phase diffraction spots at 
voidites that gave a strong nitrogen signal; see Fig. 5.64b. Such differences might 
result from different geological conditions during the diamond formation. 

Very small (0.5—10 nm) crystals of diamond have been found in chondritic mete- 
orites. Their carbon K-edge (Fig. 5.65c) showed a prominent feature just below the 
main absorption threshold, characteristic of transitions to 2* states in sp*-bonded 
carbon and indicating that graphitic or amorphous carbon is present at the surface 
of each grain. This observation supported the proposal that the diamond was formed 
by pressure conversion of graphite during grain—grain collisions in interstellar space 
(Blake et al., 1988). 

Thin films of diamond grown by chemical vapor deposition (CVD) onto single- 
crystal silicon substrates are usually found to be polycrystalline. From the presence 
of a * peak at 285 eV, Fallon and Brown (1993) concluded that amorphous car- 
bon is present at grain boundaries and at the free surface. From its plasmon energy 
and low estimated sp? fraction, this carbon is believed to be nonhydrogenated; see 
Fig. 5.66. A lack of epitaxy may result from the presence of a sub-nanometer layer 
of amorphous carbon, visible at the substrate/film interface in STEM images of 
cross-sectional specimens formed from energy losses just before and just after the 
diamond K-threshold (Muller et al., 1993). However, some deposition conditions 
give areas with oriented growth and absence of an interfacial layer (Tzou et al., 
1994). 

A metastable hexagonal form of diamond (lonsdaleite) can be synthesized by 
shock-wave conversion of graphite. Its plasmon peak occurs at slightly lower 
energy (32.4 eV) than cubic diamond, possibly due to the presence of lattice 
defects (Schmid, 1995). Moreover, its plasmon peak is more symmetrical than cubic 
diamond, which has a “shoulder” around 23 eV arising from interband transitions. 
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Fig. 5.65 Carbon K-edges of (a) synthetic low-pressure CVD diamond, (b) synthetic high- 
pressure diamond, and (c) C5 component of the Allende CV3 meteorite. The arrow marks pre-edge 
structure due to ls — mx transitions, characteristic of sp bonding. From Blake et al. (1988), 


copyright Nature Publishing Group 


100 
80 
i2)] 
£ 
2 60 
8 
oO 
40 
=e different 
forms of 
20 carbon 
Defect 
graphite C60 & Graphite 
0 


21 23 25 27 29 31 33 35 
Plasmon Energy (eV) 


Fig. 5.66 Percentage of sp? bonding (measured from the C-K — z*; intensity) as a function of 
plasmon energy (main peak in the low-loss spectrum) for different forms of carbon. The solid line 
indicates the general trend for nonhydrogenated carbon films. The presence of hydrogen lowers the 
film density and plasmon energy (dotted line). The large filled circle at the bottom left represents 
grain-boundary amorphous carbon. From Fallon and Brown (1993), copyright Elsevier 
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5.7.3.1 Measurement of Bonding Type 


Both the low-loss and K-edge spectra of diamond and graphite differ substantially, 
enabling EELS to provide a measure of the relative sp? (diamond-like) and sp” 
(graphitic) bonding in various forms of carbon. 

Transparent films of tetrahedral-amorphous carbon (ta-C, also known as hard 
carbon or amorphous diamond-like carbon, a-DLC) can be made by laser ablation 
or by creating a vacuum arc on a graphite cathode, with magnetic filtering to ensure 
that 20-2000 eV ions are selected from the plasma. They have high hardness and a 
density about 80% of the diamond value (3.52). Berger et al. (1988) determined the 
type of bonding in these films in terms of the parameter 


_ Ix(x*) WA) 
Ix(A) Ip 


(5.33) 


where [x(z*) is the K-shell intensity in the 7* peak and Jx(A) represents K-loss 
intensity integrated over an energy range A (at least 50 eV) starting at the threshold. 
The factor 1;(A)/Jo (ratio of low-loss and zero-loss intensities) corrects for plural 
(K-shell + plasmon) scattering present in Jx(A) but not in /x(z™); this factor should 
be omitted for spectra that have been deconvolved to remove plural scattering. The 
fraction of graphitic bonding is then evaluated as f = R/Rg, where Rg is the value 
of R measured from the spectrum recorded from polycrystalline graphitized carbon. 
Papworth et al. (2000) used C¢o as their standard because its K-edge is less depen- 
dent on specimen orientation. By fitting to peaks at 285, 287, and 293 eV, they 
concluded that their evaporated amorphous carbon was 99% sp*-bonded. 

A variation on the above procedure is to measure the intensity within a narrow 
window A, centered around the zr* peak and over a broader window A> under the 
o* peak, the fraction p of sp” bonds being given by 


I (Ay) P 
= k—— 5.34 
T,(A2) 4—p ; 


where the value of k is again obtained by comparison with graphite (p = 1). With 
A; = 2eV and A> = 8 eV, this procedure was found to yield a variability of about 
5% and an absolute accuracy of +13% in the sp? fraction when applied to a large 
number of ta-C films (Bruley et al., 1995). 

For ta-C, Eq. (5.33) predicts that 15% of the bonding is sp”, the remainder being 
sp? (assuming sp! bonding to be absent). This diamond-like bonding may occur 
because a graphitic surface layer is subjected to high compressive stress or high local 
pressure by the energetic incident ions. The ta-C films contain no hydrogen, unlike 
amorphous silicon and germanium films in which hydrogen is required to stabilize 
the structure. The low-loss spectrum contains a weak peak at 6 eV (characteristic 
of z electrons) and energy-selected imaging at this energy has shown that ta-C can 
contain a low density of disk-like inclusions that are mostly sp” bonded (Yuan et al., 
1992). 
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By adjusting the ion energy selected by the magnetic filtering, the sp? fraction 
and film density of ta-C (measured from the plasmon energy or by RBS) can be var- 
ied. Plotting sp* fraction against plasmon energy, the data lie close to a straight line 
with bulk diamond as an extrapolation; see Fig. 5.66. Amorphous carbon produced 
from an arc discharge between pointed carbon rods (a common method of mak- 
ing TEM support films) also lies on this line, its sp’ fraction being typically 8%. 
A variation of Fig. 5.66 is to plot sp” fraction against the inverse square of the plas- 
mon energy, in which case straight line behavior accords with a free-electron model 
(Bruley et al., 1995). Plasmon-loss and 2*-peak measurements have been used to 
investigate the variation of film density and sp* fraction with deposition method 
(e-beam evaporation, ion sputtering, and laser ablation) and with temperature and 
thermal conductivity of the substrate (Cuomo et al., 1991). 

By introducing No into the cathodic arc, up to 30% of nitrogen can be incorpo- 
rated into ta-C, with a gradual loss of sp*> bonding and reduction in compressive 
stress with increasing nitrogen content. The fine structures of the nitrogen and car- 
bon K-edges are similar at all compositions, suggesting substitutional replacement 
of carbon by nitrogen (Davis et al., 1994). 

Amorphous carbon containing hydrogen (a-C:H), in the form of a hard and trans- 
parent film, is produced by plasma deposition from a hydrocarbon. Fink et al. (1983) 
used a Bethe sum method, based on Eq. (4.32), to measure the sp- fraction in these 
materials as f = R/Rg, where 


neat) fy Em(-1/e)dE 


= = 5.35 
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The energy-loss function Im [—1/¢(E)] is obtained from Kramers—Kronig analysis 
(Section 4.2); the energy A was taken as 40 eV and 6 as the intensity minimum 
(about 8 eV) between the zr-resonance peak (about 6 eV) and the (o + 7) reso- 
nance (around 24 eV). This procedure assumes that inelastic intensity below E = 6 
corresponds entirely to excitation of carbon z-electrons (ignoring any contribution 
from hydrogen), by analogy with graphite where Eq. (5.35) gives Rg = 0.25, con- 
sistent with one z electron out of four valence electrons per atom (Taft and Philipp, 
1965). 

For hydrogenated films, Fink et al. (1983) obtained R ~ 0.08, implying that one- 
third of the bonding is graphitic. Upon annealing at 650°C, the graphitic fraction 
increased to two-thirds and the (o +77) plasmon energy decreased by 2 eV. Since 
annealing removes hydrogen from the films, this decrease in plasmon energy may 
be partly due to loss of the electrons previously contributed by hydrogen atoms. 
Upon annealing to 1000°C, the plasmon energy increased, indicating an increase 
in density (Fink, 1989). Measurement of the 2-peak energy versus scattering angle 
gave a dispersion coefficient close to zero, implying that the z electrons undergo 
single electron rather than collective excitation. The zr-electron states are therefore 
localized in a-C:H, similar to states within the “mobility gap” of amorphous semi- 
conductors. Upon removal of hydrogen by annealing, the 2 peak became dispersive, 
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indicating formation of a band of delocalized states. Fink (1989) interpreted these 
results in terms of model for a-C:H in which z-bonded clusters are surrounded by a 
sp? matrix. 

Daniels et al. (2007) used EELS and x-ray diffraction to study the heat treatment 
of petroleum pitch at temperatures up to 2730°C. The volume plasmon (o+7) peak 
was found to give a good measure of graphitic character; the K-edge o* fine structure 
provided an indicator of the degree of longer range order. 

McKenzie et al. (1986) studied fine structure of the carbon K- and Si L-edges 
in a-Sij_xC,:H alloys as a function of composition and used chemical shifts to 
derive information about compositional and structural disorder. Amorphous car- 
bon/nitrogen alloys (CN,, where x < 0.8) have also been studied; the carbon and 
nitrogen K-edges provide a convenient measurement of film composition, while 
the presence of a strong * peak indicates that the material remains primarily sp* 
bonded (Chen et al., 1993). However, the relative strength of the 287-eV peak falls 
with increasing nitrogen content (Papworth et al., 2000; Yuan and Brown, 2000). 

Ferrari et al. (2000) examined a variety of amorphous carbon films, some 
containing hydrogen or nitrogen, and deduced physical density from both the 
x-ray reflectivity and the volume plasmon energy Ep, assuming that C, N, and H 
atoms contribute 4, 5, and 1 electrons, respectively. By comparison with Ep = 
(h/27 )(ne* /egm)'/ e they concluded that the effective mass is m = 0.87mo in 
carbon systems. 

Braun et al. (2005) have pointed out that TEM-EELS K-edges consistently show 
less fine structure than x-ray absorption spectra (at least for diesel soot), which they 
attribute to radiation damage caused by the electrons. 


5.7.4 Polymers and Biological Specimens 


Microtomed thin sections of polymers and biological tissue present problems for 
analytical TEM because of their radiation sensitivity and low image contrast. 
Energy-filtered imaging can be used to increase the contrast or to examine thicker 
sections, as discussed in Section 5.3. From spectrum image data, Hunt et al. (1995) 
formed “chemical” maps at 7 eV loss (characteristic of double bonds) showing 
polystyrene-rich regions in unstained sections of a polyethylene blend. More et al. 
(1991) used parallel recording EELS to detect sulfur in a 0.5-\1m? area of polyether 
sulfone (PES), for which the maximum safe dose (deduced from decay of the 6-eV 
peak in a time-resolved series of spectra) was estimated as 0.24 C/cm?. Rao et al. 
(1993) detected a 15% increase in carbon concentration in 40-nm-sized regions of 
ion-implanted polymers by using K-edges together with low-loss spectra (to allow 
for differences in local thickness). Kim et al. (2008) used differences in low-loss 
fine structure (revealed by energy-filtered cryo-TEM imaging) to study the copoly- 
merization of PDMS/acrylate mixtures; see also the review of Libera and Egerton 
(2010). 

Differences in carbon K-edge 1*-peak energy were used by Ade et al. (1992) 
to image polymer blends and chromosomes in a scanning transmission x-ray 
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microscope (STXM) with 55-nm spatial resolution. Similar ELNES imaging could 
be performed in an energy-selecting TEM with greater spatial resolution but with 
higher radiation dose. Du Chesne (1999) provides various examples of zero-loss, 
low-loss, and core-loss imaging of polymers. 

Biological TEM analysis is always strongly dependent on specimen prepara- 
tion. The ability to prepare ultrathin sections minimizes the unwanted background 
in core-loss spectra (Section 3.5) and mass thickness contributions to core-loss 
images (Section 2.6.5). For phosphorus L-edge measurements, the optimum speci- 
men thickness has been said to be 0.3 times the total inelastic mean free path (Wang 
et al., 1992), and for 100-keV primary electrons, this corresponds to about 100 nm 
of dry tissue or 60 nm of hydrated tissue. Rapid freezing techniques reduce the 
migration or loss of diffusible species, as needed for quantitative analysis. 

Leapman and Ornberg (1988) point out that carbon, nitrogen, and oxygen are the 
major constituents of biological specimens and their ratio (together with P and S) 
can be useful for identifying proteins and nucleotides (DNA, ATP, etc.). In fluoro- 
histidine, they measured N:O:F ratios within 10% of the nominal values, provided 
the radiation dose was kept below 2 C/cm?. Fluorine is of potential importance as a 
label, for example, for identifying neurotransmitters in organelles (Section 5.4.4). 

Na, K, Mg, Cl, P, and S are typically present as dry mass fraction between 0.03 
and 0.6% (25-500 mmol/kg dry weight, equivalent to 5-100 mmol/kg wet wt., 
assuming 80% water content). Although these elements can be analyzed by EDX 
spectroscopy (Shuman et al., 1976; Fiori et al., 1988), mass loss and specimen drift 
limit the spatial resolution. In the case of EELS, higher sensitivity for S, P, Cl, and 
Fe is obtainable by choosing L-edges, with their higher scattering cross sections. 
The L-edges of sodium and magnesium lie too low in energy while that of potas- 
sium overlap strongly with the carbon K-edge, so these three elements are more 
easily detected by EDX methods (Leapman and Ornberg, 1988). 

Calcium is present in high concentrations (~10%) in mineralizing bone but oth- 
erwise at the millimolar level. At this concentration, a 50-nm-diameter region in 
a 50-nm-thick specimen contains only about 50 Ca atoms, so measuring small 
changes in concentration requires very high sensitivity (Shuman and Somlyo, 1987; 
Leapman et al., 1993b). MLS processing and component analysis (Section 4.5.4) 
are likely to be useful tools. 

EFTEM elemental mapping of phosphorus, sulfur, and calcium was used by 
Ottensmeyer and colleagues to show the structure of chromatin nucleosomes 
and mineralizing cartilage (Bazett-Jones and Ottensmeyer, 1981; Arsenault and 
Ottensmeyer, 1983; Ottensmeyer, 1984). Very thin specimens ensured low plural 
scattering and mass thickness contributions to the image, but most of these ele- 
mental maps were obtained simply by subtracting a scaled pre-edge image from 
the post-edge image. With digital processing, pre-edge modeling can be carried out 
at each image point, allowing more accurate background subtraction; see Sections 
2.6.5 and 5.3.6. Leapman et al. (2004) have used tomographic energy-filtered imag- 
ing to measure the three-dimensional distribution of phosphorus within cells, down 
to about 0.5% concentration. A fairly high electron dose (100 C/cm?) was required 
to record the tilt series, but a resolution below 20 nm was achieved. 
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Spectrum imaging offers the possibility of extensive data manipulation after 
spectrum acquisition. For example, it allows segmentation to be used for measuring 
small concentrations of elements in particular organelles. Regions of similar com- 
position can be recognized by examining the K-edges of major constituents (C, N, 
O) and spectra from these regions can be summed to provide adequate statistics for 
measuring average trace element concentrations. Leapman et al. (1993b) used this 
technique to measure calcium concentrations (50-100 ppm) in mitochondria and 
endoplasmic reticulum (see Fig. 5.67) with a precision of better than 20%. Further 
discussion of the quantitative procedures involved is given in Aronova et al. (2009). 

The low-loss spectra of biologically important substances exhibit a broad peak 
around 23 eV, whereas ice shows a peak around 20 eV and a sharp rise around 9 eV, 
probably due to excitation across the bandgap; see Fig. 5.68a. Sun et al. (1995) 
exploited these differences in fine structure to measure the water content within 
cells, with a precision of around 2% and a spatial resolution of 80 nm. Their pro- 
cedure involved MLS fitting of spectrum image data (6-30 eV region) to standard 
spectra from ice and protein. They also produced maps of water content, showing 
pronounced differences between mitochondria, cytoplasm, red blood cells, plasma, 
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Fig. 5.67 (a, b) Regions of endoplasmic reticulum in mouse cerebellar cortex, segmented on 
the basis of their nitrogen content. (c) Spectrum obtained by summing contributions from both 
segmented regions. (d) First-difference spectrum, showing a weak Ca L-edge. (e) MLS fit of the 
Ca L-edge data points to a CaCly reference spectrum. From Leapman et al. (1993b), copyright 
Elsevier 
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Fig. 5.68 (a) Low-loss spectra of protein, DNA, lipid, sugar, and ice. (b) Water map of frozen 
hydrated liver tissue: L = lipid droplets (zero water content), M = mitochondria (average content 
57%), R = erythrocyte (65% water), P = plasma (91% water). From Sun et al. (1995), copyright 
Wiley-Blackwell 


and lipid components; see Fig. 5.68b. This same method has more recently been 
applied to mapping the water distribution in skin tissue (Yakovlev et al., 2010). 


5.7.5 Radiation Damage and Hole Drilling 


As discussed in Section 5.5.4, radiation damage provides a basic physical limit to the 
spatial resolution of electron-beam analysis, so there is considerable interest in min- 
imizing this damage. The basic damage mechanisms are summarized in Table 5.3, 
together with some ways of reducing their effect (Egerton et al., 2004). 

As discussed in Chapter 3, electrons undergo both elastic and inelastic scattering 
in a TEM specimen. Elastic scattering below 100 mrad, used to form diffraction 
patterns and bright-field images, involves negligible energy transfer (<0.1 eV) and 
no damage to the specimen. However, electrons scattered through larger angles 
can transfer several electron volts of energy and cause displacement (or knock-on) 
damage if the incident energy exceeds some threshold value, as discussed in 


Table 5.3. Mechanisms of radiation damage 


Mechanism 


Knock-on displacement 
Electron-beam heating 
Charging (SE production) 
Radiolysis (ionization damage) 


Possible antidotes 


Reduce Eo below threshold, surface coating 
Reduce beam current, cool the specimen 
Reduce beam current (possible threshold) 
Cool the specimen, surface coating 
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Section 3.1.6. High-angle scattering is a rare event, so knock-on damage is impor- 
tant only at high electron doses (>1000 C/cm? typically) and is noticeable only 
in conducting materials (particularly metals), where the high density of free elec- 
trons prevents damage by radiolysis. The consequences are permanent displacement 
of atoms within a crystal or at grain boundaries (Bouchet and Colliex, 2003) and 
removal of atoms from the specimen surface (electron-induced sputtering). In the 
latter case, a thin carbon surface coating has been shown to be effective in protecting 
the specimen for a limited period (Muller and Silcox, 1995b). 

Inelastic scattering involves significant energy transfer to the specimen, on the 
average several tens of electron volts per scattering event. Most of this energy ends 
up as thermal vibration (heat) but some goes into secondary electron (SE) produc- 
tion, giving rise to electrostatic charging of insulating specimens. In addition, the 
electron transitions involved in inelastic scattering can result in ionization damage 
(radiolysis). 

The average energy (E) deposited in a specimen, per incident electron, can be 
evaluated from its energy-loss spectrum: 


B= i EJ(E)\dE / / J(E)dE (5.36) 


where the integration is over the entire spectrum including the zero-loss peak 
(Egerton, 1982b). Because Eq. (5.36) includes plural scattering contributions to 
J(E), (E) increases more than linearly with specimen thickness. Assuming that 
only a small fraction of the energy transfer goes into SE production and radiolysis, 
the temperature rise in the beam can be estimated by equating the rates of energy 
deposition and heat loss, giving 


I(E)(t/Ay) = 4x t(T — To)/[0.58 + 2 In(2Ro/d)] + (1/2)d720(T* — Th) (5.37) 


where J is the beam current (in A), (E) is in eV, and Aj is the total inelastic mean 
free path. Radial heat conduction is assumed, « being the thermal conductivity of 
the specimen, Ro the distance between the beam and a thermal sink (e.g., grid bars, 
assumed to be at the ambient temperature 7); d is the electron beam diameter. 

Heat loss by radiation from the upper and lower surfaces of the specimen is 
represented by the last term in Eq. (5.37), ¢ being the emissivity of the specimen and 
o Stefan’s constant. However, in almost all cases this term is negligible compared to 
the conduction term (Reimer and Kohl, 2008). The temperature rise AT = T — To is 
then approximately independent of specimen thickness and depends mainly on the 
beam current (not current density) and only logarithmically on the beam diameter: 
for [= 5 nA, AT increases from 0.5° to 1.5°C as d decreases from | 4m to 0.5 nm 
(Egerton et al., 2004). 

The temperature rise is usually negligible for small probes, whereas for beam cur- 
rents of many nanoamperes, it can be tens or hundreds of degrees (Reimer and Kohl, 
2008). In beam-sensitive specimens such as polymers, the result can be melting or 
warping of the specimen, especially when accompanied by electrostatic charging. 


5.7 Application to Specific Materials 391 


Reducing the beam current is helpful, even if it results in a longer exposure time 
to acquire the data. Because radiolysis occurs more rapidly at higher temperature, 
beam heating may account for the higher radiation sensitivity of polymer films at 
higher dose rate (current density) observed by Payne and Beamson (1993). 

Radiolysis occurs because the electron excitation is not necessarily a reversible 
process: when an atom or molecule returns to its ground state, the chemical bonds 
with neighboring atoms may reconfigure, resulting in a permanent structural change. 
In crystalline specimens, structural disorder is seen as a disappearance of lattice 
fringes and a gradual fading of the spot diffraction pattern (Glaeser, 1975; Zeitler, 
1982). The disruption of chemical bonding can be seen more directly as a disappear- 
ance of the fine structure in an optical-absorption and energy-loss spectra (Reimer, 
1975; Isaacson, 1977). Radiolysis may also result in the removal of atoms from the 
irradiated area, known as mass loss. This process is of concern in elemental analysis 
by EELS or EDX spectroscopy because some elements are removed more rapidly 
than others, resulting in a change in chemical composition. 


5.7.5.1 Damage Measurements on Organic Specimens 


Although radiation damage is detrimental to electron-beam measurements, EELS 
has proven useful for examining the sensitivities of different types of specimen, the 
damage mechanisms involved, and ways of reducing the damage. Core-loss spec- 
troscopy has been used to monitor the loss of particular elements from organic 
specimens, while low-loss or core-loss fine structure has been used as a measure 
of structural order. 

The dose required for a single measurement is reduced if the spectrum is col- 
lected from as large an area of specimen as possible. In TEM image mode, this 
implies a low magnification and a large spectrometer entrance aperture. If a spec- 
trum is recorded at a time ¢ after the start of irradiation, the accumulated dose is 
D = It/A, where J is the beam current and A the cross-sectional area of the beam 
at the specimen. The remaining amount (N atoms/area) of a particular element is 
calculated from its ionization edge, making use of Eq. (4.65). If log(N) is then plot- 
ted against D, the initial slope of the data gives the characteristic or critical dose 
D, (the dose that would cause WN to fall to 1/e of its initial value, if the kinetics 
remained strictly exponential). The value of D, is an inverse measure of the radiation 
sensitivity of the specimen. 

Measurements on organic materials have shown that mass loss depends on the 
accumulated dose and not on the dose rate (i.e., De is independent of current den- 
sity). Table 5.4 lists D, for selected organic compounds exposed to 100-keV incident 
electrons. Values for other incident energies can be estimated by assuming D, to be 
proportional to the effective incident energy: T = mgv*/2 (Isaacson, 1977). Not 
surprisingly, D. is low for compounds containing unstable groups such as nitrates. 
Aromatic compounds are generally more stable than aliphatic ones, and it has been 
proposed that damage to aromatics requires K-shell ionization (Howie et al., 1985). 
Replacement of hydrogen by halogen atoms (as in chlorinated phthalocyanine) fur- 
ther reduces the radiation sensitivity, due to the increased steric hindrance (cage 
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Table 5.4 Characteristic dose for removal of specified elements from organic compounds by 100- 
keV electrons (Egerton, 1982b; Ciliax et al., 1993) 


Material Element removed D, (Clem?) 300 K Dz (C/em?) 100 K 
Nitrocellulose (collodion) Cc 0.07 0.4 
N 0.002 0.3 
O 0.007 0.6 
Poly(methyl methacrylate) Cc 0.6 1 
(PMMA) O 0.07 0.6 
Copper phthalocyanine N 0.8 
ClysCu-phthalocyanine Cl 4 >10 
Perfluorotetracosane F 0.2 0.2 
Amidinotetrafluorostilbene F 0.8 >10+ 


effect) from surrounding atoms. Fluorine attached directly to an aromatic ring can 
be remarkably stable, especially at low temperatures (Ciliax et al., 1993). 

As seen in Table 5.4, cooling an organic specimen to 100 K reduces the rate 
of mass loss, sometimes by a large factor. Cryogenic operation may not change 
the number of broken bonds but it prevents atoms from leaving the irradiated area 
by reducing their diffusion rate. EELS measurements confirm that gaseous atoms 
leave the irradiated area when the specimen returns to room temperature (Egerton, 
1980c). Lamvik et al. (1989) found that mass loss in collodion is further reduced at 
a temperature of 10 K. 

An alternative way of reducing mass loss is to coat the specimen on both sides 
with a thin (~10 nm) film of carbon or a metal. Perhaps because each surface film 
acts as diffusion barrier, mass loss is reduced by a factor of typically 2-6 (Egerton 
et al., 1987). Carbon contamination films produced in the electron beam are believed 
to have a similar protective effect. According to Fryer and Holland (1984), encapsu- 
lation also helps to preserve crystallinity, perhaps by aiding recombination processes 
or acting as an electron source. 

Radiation effects can also be monitored from the low-loss spectrum, with a 
lower dose required for measurement. From the plasmon peak intensity, Egerton and 
Rossouw (1976) measured the rate of hydrocarbon contamination as a function of 
specimen temperature and found that it became negative (indicating etching by oxy- 
gen or water vapor) below —50°C. By measuring a shift in the main plasmon energy 
toward that of amorphous carbon, Ditchfield et al. (1973) found that polyethylene 
loses a significant fraction of its hydrogen at doses as low as 107? C/cem?. In addi- 
tion, they observed the creation of double bonds (indicating cross-linking) from 
the appearance of a zr-excitation peak around 6 eV; in the case of polystyrene, an 
initially visible 2 peak decreased upon irradiation, showing that double bonds were 
being broken. Fine structure below 10 eV in the spectrum of nucleic acid bases grad- 
ually disappears during the irradiation (Isaacson, 1972a). Doses that cause these 
bonding changes are usually intermediate between those needed that destroy the 
diffraction pattern (of a crystalline specimen) and the larger values associated with 
mass loss (Isaacson, 1977). 
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Core-loss fine structure provides another indication of bonding, with the advan- 
tage that different ionization edges can be recorded to determine the atomic site 
at which damage occurs. In the case of Ge-O-phthalocyanine, Kurata et al. (1992) 
found a decrease in the z* threshold peak to be more rapid at the nitrogen edge than 
at the carbon edge, suggesting chemical reaction between N and adjacent H atoms 
released during irradiation. Conversely, the emergence of a x* peak at carbon and/or 
nitrogen edges has been observed during electron irradiation of fluorinated com- 
pounds, providing evidence for the formation of double bonds and aromatization of 
ring structures (Ciliax et al., 1993). 


5.7.5.2 Damage Measurements on Inorganic Materials 


TEM-EELS has also been used to investigate electron-beam damage to inorganic 
materials. Hydrides are among the most radiation sensitive: a dose of 0.1 C/cm? con- 
verts NaH to metallic sodium inside the electron microscope (Herley et al., 1987), as 
seen from the emergence of crystalline needles whose plasmon-loss spectrum con- 
tains sharp peaks. Metal halides are also rather beam sensitive: an efficient excitonic 
mechanism results in the creation of halogen vacancies (F-centers) and interstitials 
(H-centers), which may diffuse to the surface, resulting in the ejection of halogen 
atoms (Hobbs, 1984). Halogen loss apparently depends on the dose rate as well 
as the accumulated dose (Egerton, 1980f). Other radiolytic processes in inorganic 
materials include inner-shell ionization followed by an interatomic Auger decay 
(Knotek, 1984). 

Thomas (1982, 1984) used K-shell spectroscopy to monitor the effect of a field- 
emission STEM probe on compounds such as Cr3C2, TiCo.94, Cr2N, and Fe203. The 
dose required for the removal of 50% of the nonmetallic element was in the range 
10°-10° C/em? and did not change when specimens were cooled to 143 K, sug- 
gesting a knock-on or sputtering process as the damage mechanism. Hole drilling in 
silicon nitride was judged to be due to sputtering because the process occurred only 
above a threshold incident energy (120 keV) and because the specimen thickness 
decreased linearly (rather than exponentially) with time (Howitt et al., 2008). 

A sputtering rate can be estimated from the displacement cross section oa 
of the appropriate element, calculated as a Rutherford or Mott cross section 
(Section 3.1.6), but only if the surface-displacement energy Ey is known (Oen, 
1973; Bradley, 1988). The latter is usually taken as the sublimation energy Eg, but 
Eq = (5/3)Es appears to give better agreement with experimental data for metals 
(Egerton et al., 2010). The sputtering rate in monolayers per second is then (J/e)oq 
where J is the incident electron current density (e.g., A/cm”) and e is the electron 
charge. 

Because sputtering and bulk displacement processes are absent below some 
threshold incident energy, considerable interest has been devoted to designing TEM- 
EELS systems that work at lower accelerating voltages, while still achieving good 
spatial and energy resolution. For example, a TEM fitted with a delta-type aberra- 
tion corrector has achieved atomic resolution and 0.3-eV energy resolution at 30 kV 
(Sasaki et al., 2010). 
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5.7.5.3 Electron-Beam Lithography and Hole Drilling 


High-brightness electron sources and aberration-corrected lenses allow the pro- 
duction of nanometer-scale electron probes with very high current densities (>10° 
A/cm?) that can be quickly damaging to a TEM specimen. The implications for 
nanolithography have also been explored, with high-density information storage 
as one of the stated applications, and EELS has played an important part in these 
investigations. 

Muray et al. (1985) used a 100-kV field-emission STEM (vacuum of 10~° torr 
in the specimen chamber) with a serial-recording spectrometer to investigate beam 
damage in vacuum-evaporated films of metal halides. After a dose of 1 C/em?, a 
50-nm film of NaCl is largely converted to sodium, as shown by the appearance 
of sharp surface (3.8 eV) and volume (5.7 eV) plasmon peaks. A dose of 100 
C/cm* removes the sodium, creating a 2-nm-diameter hole. Similar behavior was 
observed for LiF but hole formation required only 10~* C/em?. In the case of MgF>, 
magnesium was formed (bulk plasmon peak at 10.2 eV) for D © 1 C/cm? but not 
removed by prolonged irradiation. In CaF , bubbles of molecular fluorine have been 
detected from the appearance of a sharp peak at 682 eV (K-edge threshold) but only 
with fine-grained films evaporated onto a low-temperature substrate (Zanetti et al., 
1994). 

A finely focused electron beam can also create nanometer-scale holes in metal- 
lic oxides. Sometimes hole drilling is seen only above a threshold current density, 
typically of the order of 1000 A/cm? (Salisbury et al., 1984). The existence of this 
threshold may indicate that a radial electric field around the beam axis (positive 
potential at the center because of the high secondary electron yield of insulators) 
must be established, of sufficient strength to remove cations in a Coulomb-explosion 
process (Humphreys et al., 1990; Cazaux, 1995). The fact that the current den- 
sity threshold for alumina was reduced by cooling the sample of 85 K (enabling 
hole drilling to be performed with a tungsten filament electron source) suggests 
inward diffusion of metal, less effective at low temperatures (Devenish et al., 
1989). 

Berger et al. (1987) used their field-emission STEM to create holes in amor- 
phous alumina. During drilling, the low-loss spectrum exhibited a peak at about 
9 eV (Fig. 5.69a), the oxygen K-edge spectrum developed a sharp threshold reso- 
nance (Fig. 5.69b), and the O/AI ratio, measured from areas under the O K- and Al 
L-edges, increased from 1.5 to 7 or more, all consistent with the creation of a bubble 
containing molecular oxygen. The bubble burst after an average time of 40 s, leav- 
ing 5-nm-diameter hole. Hole drilling in sodium 6-alumina proceeded somewhat 
differently. A sharp peak appeared, with a shoulder at 9 eV and maximum at 15 eV 
(Fig. 5.69c), suggesting surface and bulk modes in small Al spheres. At the same 
time, the Al L-edge shifted 2 eV lower in energy and became more rounded in shape 
(Fig. 5.69d), consistent with the formation of aluminum metal from an insulating 
oxide. The oxygen K-edge gradually weakened, the O/AI ratio decreasing from 1.5 
to 0.6 typically. Berger et al. (1987) suggested that oxygen is lost from both surfaces, 
forming surface indentations that grow inward, leaving behind Al particles coating 
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Fig. 5.69 (a) Low-loss and (b) oxygen K-loss spectra of amorphous alumina, before and during 
hole drilling. (c) Low-loss and (d) Al L-loss spectra of Na B-Al,O3 before and during drilling. 
(e) Low-loss spectrum of a hole plugged with aluminum. From Berger et al. (1987), copyright 
Taylor and Francis Ltd 


the inside walls. Although in most cases a hole is formed after 30 s, the zero-loss 
intensity remained well below the incident beam current, indicating scattering from 
Al within the incident beam. Occasionally, the hole became filled with a plug of 
continuous aluminum, as evidenced by bulk plasmon peaks in the low-loss spec- 
trum (Fig. 5.69e). This metallization is similar to the normal irradiation behavior of 
halides such as MgF>. 

Hole drilling has been demonstrated in many other oxides, with doses mainly in 
the range 104—10° C/cm? (Hollenbeck and Buchanan, 1990). In the case of crys- 
talline MgO, square holes are formed from growth of an indentation on the electron 
exit surface (Turner et al., 1990). Hole formation in metallic films such as aluminum 
(Bullough, 1997) and metallic alloys (Muller and Silcox, 1995b) can be attributed 
to electron-induced sputtering. 

High-angle elastic scattering, which can lead to the displacement of atoms within 
a crystal or sputtering from the surface of a specimen, is discussed in Section 3.1.6. 
Displacement occurs only for an incident electron energy above threshold value, 
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Table 5.5 Sublimation energy E,yp and threshold energy Eo" for electron-induced sputtering of 
elemental solids. The sublimation energy of carbon may be as low as 5 eV in an organic compound 
or as high as 11 eV in diamond 


Eo! (KeV) Eo! (KeV) 
Element symbol Atomicwt.A = Esup(eV) for Eg = Egup for Eg = (5/3) Esub 
Li 6.94 1.66 52 8.7 
Cc 12.0 x8 ~A2 ~68 
Al 27.0 3.42 40 65 
Si 28.1 4.63 56 91 
Ti 47.9 4.86 97 154 
Vv 50.9 5.31 111 175 
Cr 52.0 4.10 89 142 
Mn 53.9 2.93 68 109 
Fe 55.9 4.29 100 158 
Co 58.9 4.47 109 171 
Ni 58.7 4.52 109 172 
Cu 63.6 3.49 93 147 
Zn 65.4 1.35 39 63 
Ge 72.6 3.86 115 181 
Sr 87.6 1.72 65 104 
Zr 91.2 6.26 215 328 
Nb 92.9 7.50 254 385 
Mo 95.9 6.83 242 366 
Ag 107.9 2.95 129 202 
Ta 180.9 8.12 461 673 
WwW 183.9 8.92 501 728 
Pt 195.1 5.85 379 560 
Au 197.0 3.80 270 407 


From Egerton et al. (2010), copyright Elsevier 


which in the case of electron-induced sputtering of elements can be calculated 
from the sublimation energy Esyp. As seen in Table 5.5, these threshold energies 
are mostly below 200 keV, even if the surface-displacement energy Eq is taken as 
(5/3)Esub (Egerton et al., 2010). 

The sputtering rate can be estimated as Jeog/e monolayers/s, where J, is the 
current density in A cm™~ and oq is a surface-displacement cross section in cm. For 
a field-emission source and aberration-corrected probe, Je can exceed 10° A/cm?, 
giving sputtering rates of many nanometers per second. The predicted sputtering 
rate is typically a few times lower if cross sections are based on a planar escape 
potential (Section 3.1.6). 

Alternatively, low-loss EELS can be used to measure the decrease in thickness 
of a specimen with time, or core-loss measurements can reveal the loss of a specific 
element. For example, Fig. 5.70 shows that aluminum is sputtered predominantly 
from the bottom surface of a C/A] bilayer film; with carbon on the bottom surface, 
loss of Al K-signal is delayed until the carbon is removed by sputtering. Because 
sputtering is a slow process, a relatively high beam current is helpful for accurate 
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Fig. 5.70 Decrease in Al K-loss signal with irradiation time for a C/A] bilayer film, before and 
after inversion in the TEM. Measurements were made in a TEM with a LaBg thermionic electron 
source. Reproduced from (Egerton et al., 2006b), with permission from Cambridge University 


Press 


measurement of the sputtering rate, favoring the use of a thermionic (W-filament or 
LaBg) electron source. A field-emission source can deliver a higher current density, 
but with lower probe diameter and current. If the probe diameter is comparable to 
or less than the depth of the crater formed by sputtering, some sputtered material is 
re-deposited on the sides of the hole, reducing the measured thinning rate. 


Appendix A 
Bethe Theory for High Incident Energies 
and Anisotropic Materials 


Even for 100-keV incident electrons, it is necessary to use relativistic kinematics 
to calculate inelastic cross sections, as in Section 3.6.2. Above 200 keV, however, 
an additional effect starts to become important, representing the fact that the elec- 
trostatic interaction is “retarded” due to the finite speed of light. At high incident 
energies and for isotropic materials, Eq. (3.26) should be replaced by (Maller, 1932; 
Perez et al., 1977) 


@o _, “aie (7) 1 2y-1 1 I Inq, BDI 
dade “YF \ i.) | ot ~ 7oG—-O * H—or ’ Gamer | 

(A.1) 
where y = 1/(1 — v?/c*)!/?, v is the incident electron velocity, ap = 52.92 pm is 
the Bohr radius, R = 13.6 eV is the Rydberg energy, and moc” = 511 keV, the rest 
energy of an electron. For most collisions, the last three terms within the brackets of 
Eq. (A.1) can be neglected and the ratio (k/ko) of the fast-electron wavenumbers 
(after and before scattering) can be taken as unity. The quantity Q has dimensions 
of energy and is defined by 


h2 2 FZ FZ 
=> = R(qao) — 


A.2 
2moc2 ee) 


~ 2m9 — 2moc2 
where q is the scattering vector and E represents energy loss. The E” /2mgc? term in 
Eq. (A.2) can become significant at small scattering angles. 
In Eq. (A.1), |n(q, E)/? is an energy-differential relativistic form factor, equal 
to the nonrelativistic form factor |e(q, E)|* in the case of high-angle collisions but 
given for gag << 1 by (nokuti, 1971): 


2 1(df i 
Inq. EN" = 5 (4) E eed (A.3) 


where djf/dE is the energy-differential generalized oscillator strength employed in 
Sections 3.2.2 and 3.6.1. 

According to Fano (1956), the differential cross section is a sum of two indepen- 
dent terms (incoherent addition) and within the dipole region, 0 << (E/Ep)""”, his 
result can be written as 
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do 4ap, ( df ) 1 (v/c)0762 (Aa) 


dQdE ~ (E/R\T/R) \ dE) | 6? + 62” (62 + 0262 + 62/y2)° 


where T = mov” /2 and 6g = E/(ymov*) = E/(2yT) as previously. The first term 
is identical to Eq. (3.29) and gives the Lorentzian angular distribution observed at 
lower incident energies. It arises from Coulomb (electrostatic) interaction between 
the incident and atomic electrons and involves forces parallel to the scattering 
vector q. 

The second term in Eq. (A.4) represents the exchange of virtual photons involv- 
ing forces perpendicular to q (transverse excitation). This term is zero at 0 = 0 
and negligible at large @, but can be significant for small scattering angles. It 
becomes more important as the incident energy increases: for Eg > 250 keV it 
shifts the maximum in the angular distribution away from zero angle, as illustrated 
in Fig. A.1. This angular dependence has been confirmed by energy-filtered diffrac- 
tion patterns of the K-shell excitation in graphite by 400-keV and 1-MeV electrons 
(Kurata et al., 1997). The displaced maximum should not be confused with the Bethe 
ridge (Sections 3.5 and 3.6.1), which occurs at higher scattering angles and only for 
energy losses well above the binding energies of the electrons involved. 
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Fig. A.1 Differential cross 
section for K-shell scattering 
in aluminum, at an energy 
loss just above the ionization 
edge, calculated for three 
values of incident-electron 
energy using a hydrogenic 
expression for df/dE. Solid 
curves include the effect of 
retardation; the dashed curves 
do not. From Egerton (1987), 8 (mrad) 
copyright Philips Electron 

Optics 


Appendix A: Bethe Theory for High Incident Energies and Anisotropic Materials 401 


Integration of Eq. (A.4) up to a collection angle f gives 


do _ dna, df 2 192 
dE (E/R\T/R) dE [inci + 6/08) + GB, 66) a 
where 
p+e \ vf - 
=21 l 2 
G(B, y, 9B) ny (P| c? \ p2 + 68/y? a 


The retardation term G(6, y, Og) exerts its largest effect at 8 * Og and increases the 
energy-loss intensity by 5% to 10% for Ey = 200 keV, and by larger amounts at 
higher incident energy; see Fig. A.2. Under certain conditions, this increase in cross 
section results in the emission of Cerenkov radiation (Section 3.3.4); when Q = 0, 
E = pc, giving a resonant condition with photons. For 8 >> 6g but still within the 
dipole region, Eq. (A.5) simplifies to an alternative form given by Fano (1956). 

For an ionization edge whose threshold energy is Ex, Eq. (A.5) can be integrated 
over an energy range A that is small compared to Ex to give 


(p.8) = 9840 ay fin (14+ 62/46e)?) + G(B.7.(66))] AD 

ou(B.) = pay fin(1+ £7/00)2) +68.7.(00)] (A 
((E) /R) (T/R) 

where (E) and (0g) are average values of E and 0g within the integration region. 

Numerical evaluation shows that Eq. (A.7) is a better approximation if a geometric 

(rather than arithmetic) mean is used, so that (E) = [Ex (Ex + A)]!/ ? and (Og) = 

(E)/2yT. This equation can be used to calculate cross sections for EELS elemental 


100 G / In(1+B7/0,”) 


1 B/0, 10 


Fig. A.2_ Percentage increase in cross section (for four values of incident energy) as a result of 
relativistic retardation, according to Eq. (A.7) 
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analysis from tabulated values of the dipole oscillator strength f(A), as in the SIGPAR 
program (Appendix B). 

If the integration is carried out over all energy loss, the result is the Bethe 
asymptotic formula for the total ionization cross section for an inner shell, used 
in calculating x-ray production: 


Arai Nxbx ! (*) 421 a (A8) 
On = n n So : 
* (Ex/R)(T/R) LER Y~ 2 


where Nj; is the number of electrons in the shell k (2, 8, and 18 for K, L, and M 
shells); by and cg are parameters that can be parameterized on the basis of experi- 
mental data (Zaluzec, 1984). As seen from Eq. (A.8), a Fano plot of von against 
In [v2 Ke —v)—v?/ c*| should yield a straight line even at megaelectron volt ener- 
gies (Inokuti, 1971). The last two terms in Eq. (A.8) cause o, to pass through a 
minimum and exhibit a relativistic rise when the incident energy exceeds about 
1 MeV. 


A.1 Anisotropic Specimens 


The equations above are believed to be relativistically correct for a specimen that is 
isotropic in its physical properties, such as an amorphous material or a cubic crys- 
tal. Anisotropic materials are more complicated because their dielectric properties 
depend on the direction of q, and therefore on the angle I: between the incident 
beam and the z-axis (Fig. 3.60a). 

We illustrate this situation by considering the case of a uniaxial material such 
as graphite or hexagonal boron nitride (h-BN), in which the atoms lie in layers 
containing the x- and y-axes, separated by a larger interatomic distance (and weaker 
bonding) in the perpendicular z-direction. In the xy-plane there are three hybridized 
o-bonding orbitals per atom (formed from 2s, 2p,, and 2py atomic orbitals) and in 
the z-direction a m-bonding orbital (formed from 2p-) directed perpendicular to the 
layers. At higher energy lie the associated unfilled o* and n* antibonding states; the 
excitation of K-shell electrons involves both 1s — o* and 1s — n* transitions, their 
relative strengths depending on the angle  (Leapman et al., 1983). 

Using relativistic theory, Eqs. (3.169) and (3.170) become, for the angular 
dependence of these two components (Souche et al., 1998; Radtke et al., 2006): 


a (y~ 76g cos I’ — 6, sin ry 


Jn 
(02 + y262)" 
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Fig. A.3 Intensities J;, J;, and their sum, calculated from Eqs. (A.9), (A.10), and (A.11) for 
I = 0 and 300 keV incident electrons, compared to the equivalent quantities calculated from 
non-relativistic theory (y = 1) 


where 67 = 6? + 6. Considering the = 0 case for simplicity: the angular widths 
of both J, and J, are reduced as a result of the y~ term in the denominators of 
Eqs. (A.9) and (A.10), but whereas J; is reduced in amplitude, J, is increased; see 
Fig. A.3. Therefore the angular width of the total (J; and J, ) intensity is broadened, 
because of the J, peaks at a non-zero angle (0 = 0/7); see Fig. A.3. 

For an isotropic material, Eqs. (A.19) and (A.10) give the total intensity as 


2 42 
67 + y~ 402 


rot = In + Ig © ———¥, 
(02 + y~763) 


(A.11) 


which is independent of specimen tilt but has a somewhat non-Lorentzian distri- 
bution because of the y~? and y~ factors. For Ey > 250 keV, the total intensity 
peaks away from @ = 0, reflecting the larger relative contribution from the transverse 
component, just as in Fig. A.1. Radtke et al. (2006) recorded energy-filtered scat- 
tering patterns at the m- and o-peak energies of graphite (Fig. 3.60b), using incident 
energies of 200 and 300 keV. Although the differences were not large, the relativistic 
equations were found to give better agreement with experiment. 

Schattschneider et al. (2005) explain how longitudinal and transverse compo- 
nents of momentum transfer must be treated differently in the case of an anisotropic 
material. Due to relativistic contraction in the direction of motion, the electric field 
of a sufficiently fast electron is no longer spherically symmetric, resulting in a 
greater relative contribution from excitations transverse to the motion. In terms 
of the formulation of Fano (1956), this means that the longitudinal and transverse 
contributions cannot be added incoherently in anisotropic materials; the transition 
matrix is not a spherical tensor and interference terms in the dynamic form factor 
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Table A.1_ Magic angle 6m 


in units of @¢ and in Eo(keV) Om /OE Om(mrad) 
milliradians (for E = 284 eV) aS —_—_—_—S 000 
as a function of incident 50 2.92 8.68 
energy, according to 100 2.25 3.48 
: P 150 1.79 1.91 
Schattschneider et al. (2005) 200 jae a 
300 1.22 0.60 
400 0.78 0.35 
500 0.61 0.23 
1000 0.25 0.05 


cannot be neglected. These terms are of the order of 20% for Ey = 200 keV and 
proportional to (v/c)*. However, the longitudinal (z) term is positive whereas the 
perpendicular term is negative; they exactly cancel for an isotropic material, leaving 
only a small relativistic correction, proportional to (v/c)*. 

A further consequence of this interference effect is a small value of the magic 
angle (Section 3.10). A spectrum recorded with a collection angle equal to 6m 
is identical to that recorded from a random-polycrystalline (orientation-averaged) 
isotropic material because in this condition the m and o interference terms exactly 
cancel. Relatistivistic theory (Jouffrey et al., 2004; Schattschneider et al., 2005; Sun 
and Yuan, 2005) predicts a rapid fall in 6 with increasing incident energy; see 
Table A.1. 


Appendix B 
Computer Programs 


The computer codes discussed in this appendix generate spectra, process spectral 
data, or calculate scattering cross sections or mean free paths. They are designed as 
a supplement to Digital Micrograph scripts (Mitchell and Schaffer, 2005) and can 
be downloaded from http://tem-eels.com or from http://tem-eels.ca 

All are written in MATLAB script. A program to convert DigitalMicrograph data 
files into MATLAB format is available. As these programs may be updated from 
time to time, the description that follows in this appendix may not be exact. 


Each program can be run in several ways: 


1. From the MATLAB Editor window, via the Run (F5) command of the Debug 
menu, or by clicking on the green triangle and entering input variables as 
directed; 

2. By typing the appropriate ProgramName(InputParameters) as given at the begin- 
ning of each program listing and as discussed below, this option being convenient 
for repetitive use with minor changes to the input parameters; 

3. Many of the programs can be run using the free Octave software available from 
http://www.gnu.org/software/octave/download.html. Typically they are run by 
typing the program name in a command window, then entering input parameters 
as requested. 


B.1 First-Order Spectrometer Focusing 


PRISM calculates first-order focusing properties of a single-prism spectrometer, 
based on the matrix method: Eqs. (2.8), (2.9), (2.10), and (2.11). Beam displace- 
ments (x,y) and angles (x’, y’) in radians are calculated for an entrance cone of 
semi-angle 1 mrad and a specified distance v relative to the prism exit point. The 
program then changes the image distance v to correspond to the calculated dis- 
persion plane (where x = 0) and (x, y, x’, y’) values are recalculated. Second-order 
matrices are not calculated; higher order focusing can usually be corrected by the 
use of external multipoles. 


R.F. Egerton, Electron Energy-Loss Spectroscopy in the Electron Microscope, 405 
DOI 10.1007/978-1-4419-9583-4_7, © Springer Science+Business Media, LLC 2011 


406 Appendix B: Computer Programs 


Input is of the form: Prism(u, eps1, eps2, K1, g, R, phi, v), with symbols as used in 
Section 2.2. An input of the form Prism(620, 16, 47, 0.4, 1.25, 100, 90, 90) produces 
the following output: 


For v = 90, x = —0.0121215, x’ = —0.00322115, y = —0.0480911, 
y’ = —0.00742323 
For v = 86.2369, x = 0, x’ =—0.00322115, y = —0.0201568, y/ = —0.00742323 


The dimensions u, v, R, and g can be expressed in any consistent set of units 
(mm, cm). KJ = 0 implies a SCOFF approximation and gives results equivalent 
to Eqs. (2.4) and (2.5); KJ ~ 0.4 corresponds to mirror planes and tapered magnet 
edges, the x-focusing is unchanged and the y-focusing is slightly weaker. 


B.2 Cross Sections for Atomic Displacement and High-Angle 
Elastic Scattering 


SIGDIs calculates elastic cross sections for bulk (knock-on) atomic displacement 
or electron-induced surface sputtering. It uses angle-integrated analytical formulas 
based on the Rutherford and the McKinley—Feshbach—Mott approximations. For 
Z > 35, the Rutherford value may be the more accurate (depending on the values of 
Ed and E0). Input is of the form: SigDis(Z,A, Ed, E0). 

The maximum energy transfer Emax is obtained from Eq. (3.12b). For bulk dis- 
placement in common metals, the minimum transfer Emin can be taken as the 
displacement energy Ey, which can be evaluated by applying Eq. (3.12c) to the 
threshold energies tabulated by King et al. (1987). 

For surface sputtering, cross sections are calculated for a minimum transfer 
Emin equal to the displacement energy Eq (spherical escape potential) and also for 
Emin = (EaEmax)!/ 2 (planar surface potential), as discussed in Section 3.1.6. The 
displacement energy Eq can be taken as the sublimation energy Egyp or as (5/3)E sub, 
the latter being probably more accurate for metals (Egerton et al., 2010). 

SIGADF calculates Rutherford and McKinley—Feshbach—Mott cross sections 
for elastic scattering between two specified angles, Omin and Omax, appropriate to 
STEM measurements or imaging with a HAADF detector. Input is of the form: 
SigADF(Z,A,gn,qx,E0) where gn and gx represent the minimum and maximum 
angles. A warning is given if Omin is below the Lenz screening angle: 6) = 
Z'/3 /kgao, indicating that screening and diffraction effects may make the result 
inaccurate. 


B.3 Lenz-Model Elastic and Inelastic Cross Sections 
LENZPLUS calculates cross sections of elastic and inelastic scattering (integrated 


over all energy loss) for an element of chosen atomic number, based on the atomic 
model of Lenz (1954). It uses Eqs. (3.5) and (3.15) for the differential cross sections 
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at a scattering angle 6, Eqs. (3.6), (3.7) and a more exact version of Eq. (3.16) for 
the cross sections integrated up to a scattering angle B, and Eqs. (3.8) and (3.17) 
for the total cross sections (large 6). Fractions F of the elastic and inelastic scat- 
tering accepted by the angle-limiting collection aperture are also evaluated and are 
likely to be more accurate than the absolute cross sections. The elastic-scattering 
values are not intended to apply to crystalline specimens. Input is of the form 
LenzPlus(E0, Ebar,Z, beta,toli), where toli denotes the inelastic scattering parame- 
ter /A; that is used only in the second half of the program; a value of 0 terminates 
the program halfway. 

To provide inelastic cross sections, the Lenz model requires a mean energy loss 
Ebar, a different average from that involved in the formula for mean free path 
Eq. (5.2). Following Koppe, Lenz (1954) used Ebar = J/2, where J (* 13.5 Z) is 
the atomic mean ionization energy. This option is invoked by entering Ebar = 0 in 
the program. From Hartree—Slater calculations, Inokuti et al. (1981) give the mean 
energy per inelastic collision for elements up to strontium; values are in the range 
20-120 eV and have an oscillatory Z-dependence that reflects the electron-shell 
structure, which is appropriate for atoms but less so for solids. 

If provided with a value of #/A; (where A; is the total-inelastic mean free path, 
integrated over all scattering angles), LENZPLUS calculates the relative intensi- 
ties of the unscattered, elastically scattered, inelastically scattered, and (elastic + 
inelastic) components accepted by the collection aperture, including scattering up 
to fourth order and allowing for the increasing width of the plural-scattering angu- 
lar distributions, as described by Eqs. (3.97), (3.108), and (3.110). The expression 
In(,/Jo) is calculated for comparison with t/A;(6) to assess the effect of this angular 
broadening. 


B.4 Simulation of a Plural-Scattering Distribution 


SPECGEN generates a series of Gaussian-shaped “plasmon” peaks, each of the form 
exp[—(1.665E/AE,,)*], whose integrals satisfy Poisson statistics and whose full 
widths at half maximum are given by 


(AE,)” = (AE) + n(AEp)” (B.1) 


Here AE is the instrumental FWHM and AE, represents the natural width of 
the plasmon peak. This plural-scattering distribution (starting at an energy —ez 
and with the option of adding a constant background back) is written to the file 
SPECGEN.PSD; the single-scattering distribution (with first channel corresponding 
to E = 0) is written to SPECGEN.SSD to allow a direct comparison with the results 
of deconvolution. 

Input is of the form: SpecGen(ep, wp, wz, tol,a0,ez,epc,nd, back,fback,cpe). The 
program approximates noise in an experimental spectrum in terms of two compo- 
nents. Electron-beam shot noise (snoise) is taken as the square root of the number 
of counts (for each order of scattering) but multiplied by a factor fpoiss, taken as 
cpe'?, where cpe is the number of counts per beam electron, assuming Poisson 
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noise statistics. For an electron-counting system, fpoiss = cpe = 1; otherwise the 
value may be greater or less than 1, depending on the sensitivity of the scintillator. 
Background noise (bnoise), which might represent electronic noise of the electron 
detector, is taken as the background level multiplied by a factor fback. Stochastic 
numbers rndnum (mean amplitude = 1) are generated as rounding errors of arbi- 
trary real numbers r/num and are not truly random; they repeat exactly each time the 
program is run (not necessarily a disadvantage). Setting cpe = 0 = fback provides a 
noise-free spectrum. 

Input is of the form: SpecGen(ep, wp, wz, tol,a0,ez,epc,nd, back,fback,cpe). The 
output files are also plotted. 


B.5 Fourier-Log Deconvolution 


The program FLOG calculates a single scattering distribution based on Eqs. (4.10) 
and (4.11). Input is of the form FLOG(infile,fwhm2). PSD data generated by 
SPECGEN can be used as a test infile. 

Data values are read from the chosen two-column infile, assumed to consist of 
pairs of floating point numbers representing energy loss and spectral intensity. The 
instrumental background level back is estimated from the first five intensity (y) 
values; if these points are not representative, background subtraction should be per- 
formed manually before using FLOG. The eV/channel value epc is obtained from 
the first and fifth energy (x) values; the zero-loss channel number nz is found by 
detecting the first y-value maximum. The separation point nsep between the elastic 
and inelastic data is taken as the subsequent minimum in J(E)/E; the 1/E weighting 
helps to discriminate against glitches on the zero-loss profile. The zero-loss sum a0 
is taken as the sum of channel counts (above background) up to channel nsep. Any 
discontinuities in the data (e.g., obtained through separate CCD readouts) must be 
removed by prior editing. 

The data are transferred to the nn elements of an array d, subtracting any back- 
ground and shifting the spectrum, so that the first channel corresponds to zero energy 
loss. The data are extrapolated to the end of the array by fitting the last 10 data 
channels to an inverse power law, using Eq. (4.51), then a cosine bell function is 
subtracted to make the data approach zero at the end of the array without a disconti- 
nuity in intensity or slope at the last recorded data point d(nfin). The zero-loss peak 
is copied from array d and the discontinuity at the separation point (sep) is removed 
by subtracting a cosine bell function, a procedure that preserves the zero-loss sum 
as a0. 

An effective width fwhm1 of the zero-loss peak is estimated from the peak height 
and area, taking the peak shape to be Gaussian. The program then asks for a choice 
of reconvolution function: either the zero-loss peak (if zero is entered) or a Gaussian 
peak of specified width fwhm2. If this width is the same as fwhm1, there is no peak 
sharpening and no noise amplification, but peak-shape distortion due to an asym- 
metric resolution function is corrected. Discrete Fourier transforms are computed, 
replacing the original data in d and z. The Fourier coefficients are manipulated 
according to Eq. (4.10), allowing scattering parameters up to t/A = a to be 
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accommodated. The higher-frequency coefficients are attenuated to avoid noise 
amplification, using a Gaussian filter function or by multiplying by the Fourier trans- 
form of Z(E). An inverse transform is then computed, giving the single-scattering 
distribution (without zero-loss peak) as the real coefficients in array d and in the 
output file Flog.ssd. Prior division by the number nn of real data points ensures that 
the output is correctly scaled. 

The program FLOGS works for thicker specimens (t/A > mr), using the procedure 
described by Spence (1979). Channels nn/2 to nn of the array d are set to zero before 
computing the Fourier transform. After applying Eq. (4.20) and taking an inverse 
transform, channels nn/2 to nn contain a mirror image of the single-scattering dis- 
tribution, which is discarded before writing the first nn/2 channels to the output file 
FLOGS.SSD. 


B.6 Maximum-Likelihood Deconvolution 


RLuCY uses the Richardson—Lucy method of maximum-likelihood deconvolution 
and can sharpen spectral data (improving the energy resolution) to an extent depen- 
dent on the noise content of the original spectrum. It is based on C-code kindly 
supplied by Lijun Wu and can be tested using SPECGEN.PSD as input. 

For spectral sharpening, the kernel is a zero-loss peak, supplied either as a sep- 
arate file or stripped from the low-loss spectrum. In the case of core-loss data, the 
kernel can be the zero-loss peak (for spectral sharpening only) or the low-loss spec- 
trum, in which case plural (plasmon + core-loss) scattering is removed, in addition 
to any sharpening. To avoid end-of-range artifacts, the intensity at the start and end 
of the data array should be roughly matched, although this criterion is less impor- 
tant than for Fourier deconvolution methods. In the case of a background-subtracted 
ionization edge, the matching can be achieved by extrapolating the intensity to zero 
at high energy loss. For an edge with background, it is sufficient to adjust the energy 
width of the data, so that the intensities at both ends of the data array are equal 
(Egerton et al., 2008). 

The program provides the option of dividing input data by a gain-reference spec- 
trum and subtracting either a dark-current spectrum or a constant dark current, if 
these operations were not already performed. The number N of iterations must be 
specified; N = 15 is typical but several values should be tested. The energy resolu- 
tion in the data improves with increasing N, and an estimate of the resolution (for 
a given N) can be obtained by using the same low-loss spectrum as both data and 
kernel (Egerton et al., 2008). 


B.7 Drude Simulation of a Low-Loss Spectrum 


The program DRUDE calculates a single-scattering plasmon-loss spectrum for 
a specimen of a given thickness tnm (in nm), recorded with electrons of a 
specified incident energy eO by a spectrometer that accepts scattering up to a 
specified collection semi-angle beta. It is based on the extended Drude model 
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(Section 3.3.2), with a volume energy-loss function e/fin accord with Eq. (3.64) and 
a surface-scattering energy-loss function sre/f as in Eq. (4.31). Retardation effects 
and coupling between the two surface modes are not included. The surface term can 
be made negligible by entering a large specimen thickness (tnm > 1000). 

Surface intensity srfint and volume intensity volint are calculated from 
Eqs. (4.31) and (4.26), respectively. The total spectral intensity ssd is written to 
the file DRUDE.SSD, which can be used as input for KRAKRO. These intensities are 
all divided by Jo, to give relative probabilities (per eV). The real and imaginary parts 
of the dielectric function are written to DRUDE.EPS and can be used for comparison 
with the results of Kramers—Kronig analysis (KRAKRO.DAT). 

Written output includes the surface-loss probability P, obtained by integrating 
srfint (a value that relates to two surfaces but includes the negative begrenzungs 
term), for comparison with the analytical integration represented by Eq. (3.77). The 
volume-loss probability Py is obtained by integrating volint and is used to calcu- 
late the volume plasmon mean free path (lam = tnm/Py). The latter is listed and 
compared with the MFP obtained from Eq. (3.44), which represents analytical inte- 
gration assuming a zero-width plasmon peak. The total probability (P;} = Py+ Ps) is 
calculated and used to evaluate the thickness (/am.P;) that would be given by the for- 
mula ¢/A = In(/;/Jo), ignoring the surface-loss probability. Note that P, will exceed 
1 for thicker specimens (t/A > 1), since it represents the probability of plasmon 
scattering relative to that of no inelastic scattering. 

The command-line usage is Drude(ep, ew, eb, epc, beta, e0,tnm,nn), where ep is the 
plasmon energy, ew the plasmon width, eb the binding energy of the electrons (0 for 
a metal), and nn is the number of channels in the output spectrum. An example of 
the output is shown in Fig. B.1a,b. 


B.8 Kramers—Kronig Analysis 


The program KRAKRO calculates the real part ¢;(£) and imaginary part €2(£) of the 
dielectric function, as well as the specimen thickness ¢ and mean free path (8) for 
inelastic scattering. It employs the Fourier procedure for Kramers—Kronig analysis 
described by Johnson (1975), but using fast-Fourier transforms. As input, it requires 
a single-scattering distribution with no zero-loss peak but with its first channel corre- 
sponding to E = 0 (as generated by DRUDE.SSD). The program also requires values 
of the zero-loss integral a0, incident-electron energy e0, collection semi-angle beta, 
and optical refractive index ri. In the case of a metallic specimen, any large value 
(>20) can be entered for ri. 

The single-scattering intensity S(E) is read and transferred to the arrays ssd and 
d, each of adequate length nn = 2", where k is an integer. Assuming a Lorentzian 
angular distribution, an aperture correction is applied to S(E) to make the result pro- 
portional to Im(—1/e); the proportionality constant rk = K is evaluated by utilizing 
the Kramers—Kronig sum rule, Eq. (4.27). Since K = Ipt/(agmov*) according to 
Eq. (4.26), this leads to an initial estimate of specimen thickness and mean free path, 
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Fig. B.1 (a) Dielectric data generated by Drude for ep 15, ew 2, eb 0, epc 0.1, 
e0 = 100, beta = 5, nn = 300, tnm = 50. (b) Drude bulk, surface, and total intensities, given as 
dP/dE in eV~'. (c) Dielectric data generated in the first iteration by KraKro with a0 = 1, ri = 1000, 
and 6 = 0.1, giving t = 56 nm (t = 47 nm in second iteration). (d) KraKro total and surface-mode 
intensities (in eV~!) generated in first iteration 


evaluated as A = t/(t/X) = tlo/I, where J, is the integral of SCE). Then Im(—1/e) 
is copied to the array di before being converted to its Fourier transform, the imagi- 
nary part of d becoming the sine transform of Im(—1/¢). With appropriate scaling 
(/nn) these coefficients are converted into the real part of d and an inverse (cosine) 
transformation yields Re(1/e)— 1, accompanied by its reflection about the midrange 
(nn/2) axis, due to aliasing. Taking the high-energy tail to be proportional to E~*, 
this energy dependence is subtracted from the lower energy (< nn/2) data and used 
to extrapolate the high-energy values, a procedure that becomes less critical if nn is 
made considerably larger than the minimum required value. 

Knowing Re(1/e) and Im (—1/e), the real part eps/ and imaginary part eps2 of 
€ are computed, followed by the surface energy-loss function srfelf and the surface- 
scattering intensity srfint, these data being written to the output file KRAKRO.DAT. 
The written output is truncated to the original range (nlines) of the input data. 
Calculation of the surface-mode scattering is based on Eq. (4.31), which assumes 
clean (unoxidized) and smooth surfaces that are perpendicular to the incident beam, 
neglecting coupling between the surfaces (1/R, = 1+¢). The volume-loss intensity, 
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obtained by subtracting srfint from ssd, is then renormalized by re-applying the K-K 
sum rule, leading to revised estimates for the specimen thickness and inelastic mean 
free path. Kramers—Kronig analysis is then repeated to yield revised values of the 
dielectric data. 

By setting nloops > 2, further iterations are possible. Whether convergence is 
obtained depends on the behavior of the data at low energy loss (E < 5 eV). To aid 
stability, E is replaced by E + 4 in the expression for the surface-scattering angular 
dependence adep, thereby avoiding a non-zero value of srfint at E = 0. For thicker 
samples, 5 = 0.1 eV is sufficient but if surface losses are predominant, 6 ~ 0.5 eV 
may be needed to ensure iteration stability (Alexander et al., 2008). An example of 
the data output is given in Fig. B.1(c,d). 


B.9 Kréger Simulation of a Low-Loss Spectrum 


The program KROEGER evaluates the differential scattering probability d7P/dQ dE 
in the low-loss region for a chosen incident energy, specimen thickness, collection 
semi-angle, and eV/channel, based on Eq. (3.84a). As input it requires (E, €1, €2) 
data, as provided by DRUDE.EPS for example, together with values of incident 
energy, specimen thickness, and collection semi-angle. The calculation procedure 
is similar to that used in the Cerenkov script written for Digital Micrograph by 
Wilfried Sigle. 

As output, a three-dimensional contour plot is generated, including all terms in 
Eq. (3.84a), and a separate plot that represents only the first (volume) term, equiv- 
alent to Eq. (3.70). Other two-dimensional plots represent the angular dependence 
at selected energy losses and the energy-loss spectrum at a particular (e.g., zero) 
scattering angle, these selected values being specified within the program. The prob- 
ability is integrated (over the input value of angular range £) to give dP/dE. The 
integration uses a MATLAB function guadl, which may need to be replaced if the 
program is run using Octave. 

A separate program KROEGEREBPLOTS demonstrates some of the physics 
behind retardation effects by plotting data for 10 different thicknesses and spec- 
ified 6, FE, and Ep, using as input (for example) DRUDE.EPS or silicon data 
(KROEGEREBPLOTS_SI.DAT). Angular dependences are plotted for a fixed E, for 
comparison with fig. 3 of Erni and Browning (2008), where E = 3 eV and 
Eo = 200 keV. Energy dependences of dP/dE are then plotted, integrated up to 
a scattering angle 6 for comparison with Fig. 3.26b (fig. 4a of Erni and Browning), 
where Eg = 300 keV, 8 = 2.1 mrad. 


B.10 Core-Loss Simulation 


COREGEN constructs an idealized core-loss profile based on a power-law (AE~") 
single-scattering distribution but with plural (plasmon + core-loss) scattering 
included. It uses Eq. (3.117), the plasmon-loss spectrum being taken to be a series of 
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delta functions obeying Poisson statistics. The plasmon-loss and core-loss spectra 
are displayed and written to the files COREGEN.CORE and COREGEN.LOW, which 
can be used as input for testing the deconvolution program FRAT. 

EDGEGEN adds a power-law background (with the same exponent r) to the 
COREGEN output, giving an output similar to Fig. 3.33. The jump ratio of the edge 
(ratio of intensities just after/before the threshold) is specified as an input parame- 
ter. To avoid artifacts, the maximum order n of the plasmon-scattering contribution 
to the background is limited, ensuring that the AE~’ formula is not applied for 
E < 30 eV. The plasmon-loss and core-loss spectra are displayed and written to 
EDGEGEN.CORE and EDGEGEN.LOW. 


B.11 Fourier Ratio Deconvolution 


Using the Fourier ratio method described in Section 4.3.2, the program FRAT 
removes plural scattering from an ionization edge whose background has previously 
been subtracted. It requires a low-loss spectrum, recorded from the same region of 
specimen at the same eV/channel, but this spectrum need not be contiguous with the 
core-loss region or match it in terms of relative intensity. The method is therefore a 
more practical alternative to FLOG in cases where the core-loss and low-loss spectra 
are obtained in separate readouts. Other advantages are that the zero-loss peak does 
not need to be extracted from the low-loss spectrum (which involves some approxi- 
mation) and that the specimen thickness is in principle unlimited (there are no phase 
ambiguities in the Fourier components). 

The low-loss spectrum is read as two-column (x,y) data from a named input file 
and the zero-loss channel is taken as the first maximum. The first minimum is found 
in order to estimate the zero-loss integral a0 and the energy resolution (obtained 
from a0 and the zero-loss peak height, assuming a Gaussian shape). The spectrum 
is transferred to a working array d, containing nn channels where nn is the next 
power of 2 that exceeds the actual number of low-loss data channels. 

Data are shifted within d, so that the first channel represents zero loss. Any 
background (average of the first five channels) is subtracted and the intensity is 
extrapolated to zero at the last channel, using a power-law extrapolation with cosine 
bell termination. Then the left half of the zero-loss peak is added to the end channels 
of the array. The energy resolution (FWHM of the zero-loss peak) is displayed, to 
serve as a guide when specifying the width of the reconvolution function; smaller 
widths lead to peak sharpening but with a severe noise penalty, as explained in 
Section 4.3.2. Even without sharpening, the effect of any tails on the zero-loss 
peak (due, for example, to the detector point-spread function) can be successfully 
removed from core-loss data. 

The core-loss spectrum is read into odd elements of an array c and extrapo- 
lated to zero in the same way as d. After taking Fourier transforms, the Fourier 
coefficients are processed according to Eq. (4.38), using a Gaussian reconvolu- 
tion function gauss. If the coefficient of this function is the zero-loss integral a0, 
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plural scattering is subtracted from the ionization-edge intensity; if the coefficient 
is changed to the total integral (at) of the low-loss spectrum, the core-loss SSD will 
have the same integral as the original edge, as needed for absolute elemental analysis 
of thick specimens (Wong and Egerton, 1995). 


B.12 Incident-Convergence Correction 


The program CONCOR2 evaluates the factor F; by which inelastic intensity (at 
energy loss FE and recorded using a collection semi-angle £) is reduced as a result of 
the convergence of the electron probe (semi-angle a). The program also evaluates 
the factor F2 (for use in absolute quantification) and the effective collection angle 
B defined by Eq. (4.73). For inner-shell scattering, the energy loss E can be taken 
as the edge energy E; or (better) as an average energy loss (Ex + A/2) within the 
integration window. 

CONCOR2 uses an analytical formula (Scheinfein and Isaacson, 1984) that 
assumes that the incident beam intensity per steradian is constant up to the angle a 
and that the angular distribution of the inelastic scattering remains Lorentzian up to 
an angle equal to a + B. 

When analyzing for two elements, a and b, incident beam convergence is taken 
into account by multiplying the areal density ratio N,/Np, derived from Eq. (4.66), 
by Fip/Fia (or by F2p/F2q). If the absolute areal density Nz of an element a is 
being calculated from Eq. (4.65), the result should be divided by F2g. Fora < B, 
Fy = F,; fora > B, F> is larger than F; (and may exceed unity; see Fig. 4.19) 
because the collection angle cuts off part of the low-loss angular cone. As an alter- 
native to applying the correction factors F; or F2, incident beam convergence can 
be incorporated by computing each ionization cross section for the effective collec- 
tion angle bstar, which is a function of energy loss and therefore different for each 
element. 


B.13 Hydrogenic K-Shell Cross Sections 


The program SIGMAK3 uses a hydrogenic approximation for the generalized oscil- 
lator strength, Eqs. (3.125), (3.126), and (3.127), to calculate differential and 
integrated cross sections and dipole oscillator strengths (f0) for K-shell ionization. 
Unlike the original SIGMAK (Egerton, 1979) and SIGMAK2 programs, the reduction 
in effective nuclear charge (due to screening by the second 1s electron) is taken 
as 0.50 rather than the value of 0.3125 calculated by Zener (1930) for first-row 
elements, the aim being to provide a closer match to EELS, photoabsorption, and 
Hartree—Slater data (Egerton, 1993). The effect of this change becomes more sig- 
nificant at low atomic number: f(100 eV) is reduced from 0.46 to 0.42 for oxygen 
and from 2.02 to 1.58 for lithium. 
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Relativistic kinematics is employed, based on Eqs. (3.139), (3.140), (3.144), and 
(3.146), but retardation effects (Appendix A) are not included. To improve accu- 
racy, the energy-differential cross section dsbyde is obtained by integrating d//dE 
(gosfunc in the program) over a logarithmic grid, as in Eq. (3.151), with limits of 
integration given by Eqs. (3.152) and (3.153). The energy-differential cross sec- 
tion dsbyde is integrated to give the partial cross section sigma, making use of the 
E-dependence described by Eq. (3.154) in order to achieve reasonable accuracy for 
a relatively large increment einc, taken as delta/10, delta being the integration win- 
dow specified in the input. In the printed output, the integration is continued beyond 
E = Ex + delta but with increasing energy steps, so that the integral cross section 
ox(f) can be displayed as an asymptotic limit. 

Total K-shell cross sections (as required for EDX spectroscopy, for example) can 
be obtained as the value of ox(f) after entering 8 = mz = 3142 mrad, delta ~ 
100 eV and taking the asymptotic value of sigma corresponding to large A. For 
Eo < 300 keV, these cross sections are within 3% of the Hartree—Slater values 
quoted by Scofield (1978) for Ar and Ni. 

Since the energy-differential cross section do/dE is independent of K-edge 
threshold energy ek, its value at any scattering angle 6 = B and energy loss E can 
be displayed by setting delta = 0 and ek = E in the input data. Likewise, a K-loss 
cross section sigma integrated between any two values of energy loss (EZ; and E2) 
can be displayed by entering the lower energy loss as ek and the energy difference 
(E2 — E)) as the specified integration window delta. 


B.14 Modified-Hydrogenic L-Shell Cross Sections 


SIGMAL3 evaluates cross sections (in barn = em? x 10774) for L-shell ionization 
by fast incident electrons. It uses relativistic kinematics (without retardation) and 
an expression for the generalized oscillator strength (Choi et al., 1973) based on 
hydrogenic wavefunctions, with the screening constants recommended by Slater 
(1930). To more correctly match observed edge shapes, the generalized oscilla- 
tor strength is multiplied by a correction factor rf, calculated for each energy loss 
through the use of an empirical parameter xu. Values of xu, together with the L3 and 
L, threshold energies of each element, are tabulated at the beginning of the program. 
Approximate allowance for white-line peaks, for 18 < Z < 28, is made by using the 
full-hydrogenic oscillator strength (7f = 1) within 20 eV of the L3 threshold. In other 
respects, the calculation follows the same procedure as SIGMAK3. 

Values of xu are modified from those of SIGMAL and SIGMAL2, so that the 
program gives integrated oscillator strengths f(100 eV) equal to tabulated values 
(Egerton, 1993) based on Hartree—Slater, EELS, and photoabsorption data. The pro- 
gram gives an output for the elements Al to Kr inclusive, although probably with 
less accuracy toward the ends of that range. The algorithm is not designed to pro- 
vide accurate values of the differential cross section dsbyde within 50 eV of the 
ionization edge or to properly simulate the Bethe ridge at high scattering angle. 
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As with SIGMAK3, total L-shell cross sections can be obtained by entering 
B = 3142 mrad and taking the asymptotic value of sigma (corresponding to large 
A); for Eg < 300 keV, the program yields cross sections which are within 8% of 
those calculated by Scofield (1978) for Ar and Ni. 


B.15 Parameterized K-, L-, M-, N-, and O-Shell Cross Sections 


The program SIGPAR2 calculates the partial cross section o;(f, A) of a major ion- 
ization edge using Eq. (A.7), which embodies the relativistic oscillator strength and 
kinematics for an isotropic material. It is valid only for a limited collection angle, 
falling within the dipole region: B << (E/Eo)'/”, since it uses values of optical 
oscillator strength f(A) (together with estimates of their accuracy) that are stored 
in the files FK.DAT, FL.DAT, FM23.DAT, FM45.DAT and FNO45.DAT. These values 
represent best estimates (Egerton, 1993) based on Hartree—Slater calculations, x-ray 
absorption data, and EELS measurements. 

The integration window A should be within the range 30-250 eV; linear interpo- 
lation or extrapolation is used to estimate f(A) for values of A other than those used 
in the tabulations. In the case of M23 edges only A = 30 eV values are given, based 
on EELS measurements of Wilhelm and Hofer (1992). If the semi-angle 6 lies out- 
side the dipole region (taken here to be half the Bethe ridge angle), a warning is 
given to indicate that the calculated cross section will be too large. Because retarda- 
tion effects are included, the results should be valid for incident electron energies as 
high as | MeV. 


B.16 Measurement of Absolute Specimen Thickness 


The program tKKs calculates the thickness of a specimen whose low-loss spectrum 
is supplied in the form of two-column (E, intensity) data. The algorithm is based 
on the Kramers—Kronig sum rule, with approximate allowance for plural scattering 
based on Eq. (5.9). A surface-mode correction term At can be invoked internally as 
an option. Because Eq. (5.9) is a dipole formula, the spectral data should correspond 
to a small collection angle: 6 < ,, where 0, © (E/Eo)!/? is a Bethe-ridge angle for 
some mean energy loss E. If B > @,, with E taken as 50 eV, the program issues a 
warning and replaces f with 6; to avoid a misleading answer. 

In place of a recorded low-loss spectrum, SPECGEN.PSD can be used as input 
data. In either case, the zero-loss integral is entered as zero to indicate that the zero- 
loss peak must be identified, its integral calculated as /0 and a correction factor C 
evaluated to make approximate allowance for plural scattering in the input data. 

Alternatively, SPECGEN.SSD, DRUDE.SSD, or any single-scattering spectrum 
with no zero-loss peak (e.g., a measured spectrum after FLOG deconvolution) can be 
used as input to TKKs. An appropriate value of the zero-loss integral must then be 
supplied; this value is | in the case of DRUDE.SSD, which represents probabilities 
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relative to the zero-loss integral. The plural scattering correction factor C is set to 1, 
since plural scattering peaks are absent. 


B.17 Total Inelastic and Plasmon Mean Free Paths 


The program IMFP calculates inelastic mean free paths for a given material, 
for specified incident energy Eo, probe convergence semi-angle a and spectrum 
collection semi-angle 8. Separate calculations are made, using the formulas of 
Malis et al. (1988), Jin and Li (2006), and Iakoubovskii et al. (2008a). Correction 
for incident-probe convergence is included, in the first two cases by use of the 
Scheinfein—Isaacson formula employed in CONCOR2. If the effective collection 
angle £* exceeds the Bethe-ridge angle (Em /Eo)'/, the program gives a warning 
that dipole conditions are violated (the formulas of Malis et al. and of Jin and Li 
include no dipole-region cutoff). 

In the case of a compound, the atomic number Z;, atomic weight A;, and atomic 
fraction f; are entered for each component until }°; f; reaches 1. An effective atomic 
number Zegf is then evaluated using Eq. (5.4), and for Jin and Li (2006) an effective 
atomic weight using the same procedure. If the specimen is a material whose mean 
energy loss Ey is known (Table 5.2), A may be more accurately calculated using the 
program PMFP. 

PMFP calculates an inelastic mean free path based on a free-electron zero- 
damping approximation, if given values of the incident energy Eo, a mean energy 
loss Em, incident convergence semi-angle a, and collection semi-angle #. It first 
evaluates the effective collection angle 6* and then estimates the dipole cutoff angle, 
the latter being taken as a Bethe-ridge angle 6, = (Em/Eo)'/”. If B* < 6, a dipole 
mean free path 4(6*) is calculated, using Eq. (5.2). Otherwise, the total-inelastic 
mean free path is A(6,) is printed. 

PMFP can also be used to calculate a plasmon cross section, using Eq. (3.58) with 
the plasmon energy Ep replacing E,,. The dipole value 1.)(6) is printed if B* < 6, = 
(Ep /Eo)'/ 2 and otherwise the total plasmon cross section Ap(@;). 


B.18 Constrained Power-Law Background Fitting 


The program AFIT interpolates a power-law (AE) background between two fitting 
regions, using linear least-squares fitting according to Eqs. (4.46) and (4.47). The 
fitted background passes through the middle of the data in both halves of the fitting 
region, making the background-subtracted intensity approximately zero at both ends 
of its range. 

The program BFIT is used for obtaining a more accurate core-loss integral 
(Egerton and Malac, 2002). Pre-edge and post-edge energy windows are chosen, 
just below and far above the threshold energy, as in AFIT. A first-estimate interpo- 
lated background is obtained, based on a generalization of Eq. (4.51) rather than on 
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least-squares fitting. A core-loss energy window is then defined, starting just beyond 
the ELNES region (e.g., 50 eV above the ionization threshold) where the spectral 
intensity falls smoothly. Based on the integrated intensity in that window and the 
r-value of the previous fit, the core-loss integral within the post-edge window is 
calculated and subtracted from the total integral to provide an estimate of the back- 
ground component. This estimate is used in a second interpolation, giving a revised 
value of r, and the process is repeated until the value of r converges to a limit. 


Appendix C 
Plasmon Energies and Inelastic Mean Free Paths 


The following table lists the atomic number Z, atomic weight A, and density p 
of some common elements and compounds, together with their chemical symbol 
and crystal structure, using the notation: a = amorphous, b = body-centered cubic, 
c = cubic, f = face-centered cubic, h = hexagonal, 1 = liquid, o = orthorhombic, 
r = rhombohedral, t = tetragonal. 

The measured “plasmon” energy Ep (generally the maximum of the main energy- 
loss peak below 40 eV) and its full width at half-maximum AE, are mostly taken 
from Daniels et al. (1970), Colliex et al. (1976a), Raether (1980), Colliex (1984), 
and Ahn (2004). The quantity Ase is a plasmon mean free path calculated from the 
free electron formula, Eq. (3.58), assuming AE, = 0 but based on the tabulated 
value of Ep, with m = mo and B = f* as specified below. 

The last two columns give inelastic mean free paths measured by Iakoubovskii 
et al. (2008b) for energy losses up to 150 eV, using 200-keV incident electrons, a 
probe convergence semiangle of a = 20 mrad, and collection semiangle 8 = 20 
mrad, giving an effective collection angle 6* of typically 12 mrad, according to 
Eq. (4.73). The plasmon mean free path Ap represents just the collective valence- 
electron component of inelastic scattering. Differences between Ap and Aj; reflect 
single-electron excitation, for example, an inner-shell ionization edge occurring 
below 150 eV. 

The total inelastic mean free path Aj; is the appropriate quantity to use in Eq. (5.1) 
for thickness measurement. At Eg = 100 keV, values of A; are about a factor of 1.45 
lower than at 200 keV; see Section 5.1, Table 5.2. 
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11 


12 


14 


12.01 


22.99 


24.31 


26.98 


28.09 


30.97 
32.06 
40.08 
44.96 
47.90 
50.94 
52.00 


54.94 
55.85 


58.93 
58.70 
63.55 


65.38 
69.72 


Material 


Li (b) 
LiH 
LiF 

Be (h) 
B (a) 
BN (h) 
BN (a) 
B203 
C(c) 

C (h) 
C (a) 
Na (b) 
NaCl 
Mg (h) 
MgO 
MgF2 
Al (f) 
Al2O3 (a) 
AlzO3 (a) 
AlAs 
Si (c) 
Si (a) 
SiC (a) 
Si3Na (@) 
SiO? (a) 
P (0) 

S (0) 
Ca (f) 
CaO 
Sc (h) 
Sc203 
Ti (h) 
TiO 

V (b) 
V205 
Cr (b) 
Cr203 
Mn (c) 
Fe (b) 
Fe203 
Co (h) 
CoO 
Ni (f) 
NiO 
Cu (f) 
Zn (h) 
Ga 
GaP 
GaAs 
GaSb 


p (g/cm?) 


0.53 
0.82 
2.64 
1.85 
2.35 
2.1 
2.28 
1.81 
3.52 
227 
1.8-2.1 
0.97 
2.165 
1.74 
3.58 
3.17 
2.70 
3.97 
3.2 


2.33 
2.2 
3.21 
3.29 
2.65 
1.82 
2.07 
1.55 
3.3 
2.99 
3.86 
4.54 
4.93 
6.11 
3.36 
719 
5.21 
7.43 
7.87 
5.24 
8.9 
6.45 
8.90 
6.67 
8.96 
7.13 
5.91 
4.14 
5.31 


Ep (eV) 


Fl 
20.9 
24.6 
18.7 
22:7 
9;26 
24 


33.2 
7;27 
24 
5.7 
15.5 
10.3 
22.3 
24.6 
15.0 
26 
23 
16.1 
16.7 
16.3 
21.5 
23.7 
22.4 


8.8 
14 
17.9 
21.8 
24.9 


21.6 
23.0 
21.8 
20.9 
24.6 
20.7 
22.6 
19.3 
17.2 
13.8 
16.5 
15.8 
13.3 


AE» (eV) 


2.2 


4.8 
18 


20 
0.4 


21 


0.6 


Afe (nm) 


289 
118 
103 
129 
110 


106 
81 


106 
348 
151 
211 
112 
103 
155 


109 
146 
142 
145 


112 


241 
604 
134 
114 
102 


115 
109 


118 


119 
111 
126 
138 
166 
143 
148 
171 


Ap (nm) 


169 
126 


116 


214 


160 


168 


160 
200 


202 
158 
149 


146 
121 


108 
103 


100 
106 


4 (nm) 


160 
123 


120 
112 


160 


150 
133 


134 
140 


145 


155 
160 
200 


130 


125 
120 
120 
109 
116 
104 
118 
106 
102 
116 

98 
115 

98 
115 
100 
106 
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(continued) 
Z A Material (g/em?) Ey (eV) AE, (eV) Age (am) Ap (nm) = Aj (nm) 
32 72.59  Ge(c) $32 15.8 148 126 120 

GeO2 6.24 130 
33 74.92 As (1) 5373 18.7 129 

As (a) 1.97 16.7 142 
34 78.96 Se (h) 4.79 17.1 6.2 205 130 

Se (a) 16.3 6.2 145 

SeO2 3.95 3:95 130 
37 85.47 Rb (b) 1.63 3.41 0.6 539 
38 87.62 Sr (b) 2.54 8.0 23 261 

SrO 4.7 (32) 126 
39 88.91  Y (h) 4.47 12.5 7 354 124 

YH2 15.3 

Y203 5.01 5.01 122 
40 91.22 Zr (h) 6.51 268 113 

ZrO 5.6 115 
41 92.91 Nb (b) 8.57 194 105 
42 Mo (b) 10.22 25.2 163 98 

Mo0O3 4.69 24.4 111 
44 101.1 Ru (b) 12:37 134 90 
46 =106.4 Pd (b) 12.02 118 94 

PdO 8.70 110 
47 = 107.9 Ag (f) 10.5 25 125 100 

Ag2O 7.14 112 
48 112.4 Cd (h) 8.65 130 107 
49 114.8 In (t) 731 11.4 12 129 110 
50. -118.7 Sn (t) 731 13.7 1.3 167 273: 115 

SnO2 6.95 115 
S51 121.8 Sb (r) 6.68 15.2 3.3 145 234 120 
52. 127.6 Te (h) 6.24 17.1 6.2 216 130 
53 126.9 I (0) 4.93 233 140 
55 132.9 Cs (b) 1.87 2.9 175 
56 137.3 Ba (b) 3.59 27.8 75 94 

BaO 5.72 (27.6) 125 
57: 138.9 La (h) 6.15 

La203 6.51 130 
58 = 140.1 Ce (f) 6.77 

Ce203 6.86 125 
59 =: 140.9 Pr (h) 6.77 

Pr203 7.07 122 
60 =144.2 Nd (h) 7.01 

Nd203 7.24 14.2 162 120 
62 150.4 Sm (r) 7.52 280 112 

Sm203 8.35 135 120 
63 Eu203 742 118 
64 = 157.3 Gd (h) 7.9 275 110 

Gd203 TAL 14.6 158 125 
65 158.9 Tb (h) 8.23 13,3: 

Tb203 73 125 
66 = =162.5 Dy (h) 8.55 310 118 

Dy203 7.81 126 


67 Ho203 8.4 120 
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(continued) 
Z A Material (g/em?) Ey (eV) AEp (eV) Ate (nm) Ap (nm) = Aj (nm) 
68 167.3 Er(h) 9.07 14 

Er203 8.64 115 
70 173.0 Yb (f) 6.90 275 110 

Yb203 9.17 115 
72 178.5 — Hf (h) 13.3 237 95 
73 181.0 Ta(b) 16.65 183 88 
74 ~=183.9 W (b) 19.35 151 82 

WO3 7.16 110 
75 186.2 Reh) 21.04 28 141 78 
77 ~=192.2 — Ir(b) 22.4 121 78 

TrO2 11.67 29 110 
78 195.1 Pt(b) 21.45 22.6 111 120 82 
79 197.0 Au (f) 19.32 24.8 120 84 
80 200.6 Hg (1) 13.55 6.4 1 

HgO 11.4 116 
81 2044 Tih) 11.85 135 95 
82 207.2 Pb (f) 11.35 13.0 141 99 

PbO 9.5 122 
83 209.0 ~— Bi (r) 9.75 14.2 6.5 162 147 105 


BinO3 8.6 125 


Appendix D 
Inner-Shell Energies and Edge Shapes 


The following table gives threshold energies Ex (in eV) of the ionization edges 
observable by EELS, based on data of Bearden and Burr (1967), Siegbahn et al. 
(1967), Zaluzec (1981), Ahn and Krivanek (1983), and Colliex (1985). The most 
prominent edges (those most suitable for elemental analysis) are shown in italics. 
Where possible, an accompanying symbol is used to indicate the observed edge 
shape: 


h denotes a hydrogenic edge with sawtooth profile (rapid rise at the threshold 
followed by more gradual decay), as in Fig. 3.43. 

d denotes a delayed maximum due to centrifugal-barrier effects (Section 3.7.1), 
giving a rounded edge with a maximum at least 10 eV above the threshold 
energy as in Figs. 3.44 and 3.47. 

w denotes sharp white-line peaks at the edge threshold, due to excitation to 
empty d-states (in the transition metals) or f-states (in the rare earths), as in 
Fig. 3.48a. 

p denotes a low-energy edge that appears more like a plasmon peak than a 
typical edge. However, the energy given is that of the edge onset, not the 
intensity maximum. 


Because of near-edge fine structure, which depends on the chemical and crys- 
tallographic structure of a specimen, this classification can serve only as a rough 
guide. Elements such as copper exist in different valence states, giving rise to dis- 
similar edge shapes (Fig. 3.46). The edge energies themselves can vary by several 
electron volt, depending on the chemical environment of the excited atom; see 
Section 3.7.4. 
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State > ls 2s 2p\? 2p>? 3p 3d 4p 
Shell > K Li Io [3 M3 Mas N23 
2 He 24.6h 
3 Li 55h 
4 Be 11lh 
5 B 188h 
6C 284h 
7 N 400h 
8 O 532h 
9 F 685h 
10 Ne 867h 18w 
11 Na 1072h 32h 
12 Mg 1305h 52h 
13 Al 1560h 118h 73d 
14 Si 1839h 149h 100d 
15 P 2149h 189h 135d 
16 S 2472h 229h 165d 
17 Cl 2823 270h 200d 
18 Ar 3203 320h 246d 
19 K 3608 377h 294w 
20 Ca 4038 438h 350w 347w 
21 Sc 4493 500h 406w 402w 
22 Ti 4965 564h 461w 455w 47 
23 V 5465 628h 520w 513w 47 
24 Cr 5989 695h 584w S75w 48 
25 Mn 6539 770h 652w 640w 51 
26 Fe 7113 846h 7T21w 708w 57 
27 Co 7109 926h T94w 779w 62 
28 Ni 8333 1008 872w 855w 68 
29 Cu 8979 1096 951h 931h 74 
30 Zn 9659 1194 1043 1020d 87 
31 Ga 1298 1142 I115d 105 
32 Ge 1414 1248 1217d 125 30 
33 As 1527 1359 1323d 144 41 
34 Se 1654 1476 1436d 162 57h 
35 Br 1782 1596 1550d 182 70d 
36 Kr 1921 1727 1675 214 89h 
37 Rb 2065 1846w 1804w 239 111d 
38 Sr 2216 2007w 1940w 270 134d 20p 
39 Y 2373 2155w 2080w 300 160 28p 
40 Zr 2532 2307w 2222w 335 181 32p 
41 Nb 2698 2465w 2371w 371 207h 35h 
42 Mo 2866 2625w 2520w 400 228h 37d 
44 Ru 3224 2967w 2838w 472 281h 42d 
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State > 3d3/2 3ds)2 4p 4d 4f Sp 5d 
Shell > M4 Ms N23 Nas No, N7 Or, O3 O04, O5 
45 Rh 312 308d 48 

46 Pd 340 335d 50 

47 Ag 373 367d 59 

48 Cd 411 404d 67 

49 In 451 443d 771 

50 Sn 494 455d 90 

51 Sb 537 528d 99 32 

52 Te 582 572h 110 40 

53 I 631 620h 123 50 

54 Xe 685 672h 147 64 

55 Cs 740w 726w 78 

56 Ba 796w 781w 93 

57 La 849w $32w 99 

58 Ce 902w 884w 110 

59 Pr 95lw 931w 114 

60 Nd 1000w 978w 118 

62 Sm 1107w 1081w 130 

63 Eu 1161lw 113lw 134 

64 Gd 1218w 1186w 141 

65 Tb 1276w 1242w 148 

66 Dy 1332w 1295w 154 30,23 

67 Ho 1391w 135lw 161 31,24 

68 Er 1453w 1409w 168 31,25 

69 Tm 1515 1468w 177 32,25 

70 Yb 1576 1527w 184 33,26 

71 Lu 1640 1589w 195 35,27 

72 Hf 1716 1662h 38,30 

73 Ta 1793 1735h 45,37 

74 W 1872 1810h 37,34 47,37 

75 Re 1949 1883h 47,45 46,35 

76 Os 2031 1960h 52,50 58,46 

77 «Ir 2116 2041h 63,60 63,51 

78 Pt 2202 2122h 74,70 66,51 

79 Au 2291 2206h 87,83 72,54 

80 Hg 2385 2295h 81,58 

81 Ti 2485 2390h 14p 
82 Pb 2586 2284h 21p 
83 Bi 2688 2580h 27h 
90 Th 3491 3332 88w 
92 U 3728 3552 96w 


In the preceding table, the notation L23 (for example) indicates Lz and L3 edges 


that are close in energy, such that the individual thresholds are unresolved or poorly 
resolved by electron microscope EELS systems. The following table relates the 
spectroscopic (shell) notation to the quantum numbers and degeneracy (2j + 1) of 
the initial state involved in a transition. The shells are listed in order of decreasing 
binding energy. 
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Shell State n l J Degeneracy 
K 1s!/2 1 0 1/2 2 
Li 21/2 2 0 1/2 2 
Ly 2p\? 2 1 1/2 2 
L3 2p??? 2 I 3/2 4 
M, 3512 3 0 1/2 2 
Mp 3p! 3 1 1/2 2 
M3 3p>? 3 1 3/2 4 
M4 3a 3 2 3/2 4 
Ms 35" 3 2 5/2 6 
M 4s!/2 4 0 1/2 2 
No 4p\? 4 I 1/2 2 
N3 Ap? 4 1 3/2 4 
Na 4d3/ 4 2 3/2 4 
Ns 4d5/ 4 2 5/2 6 
No 4p? 4 3 5/2 6 
Ny 4g 4 3 7/2 8 
Or 5p\2 5 1 1/2 2 
03 5p? 5 1 3/2 4 
O4 5a!2 5 2 3/2 4 
Os 5a? 5 2 5/2 6 


The following specimens provide accurate calibration of the energy-loss axis for 
a high-resolution spectrometer system (P.E. Batson, personal communication): 


aluminum (midpoint of edge onset = 72.9 eV) 
silicon (midpoint of edge onset = 99.9 eV) 
amorphous SiO? (L23 edge maximum = 108.3 eV) 
graphite (maximum of m* peak = 285.37 eV) 
NiO (Ni £3 maximum = 852.75 eV) 


Appendix E 
Electron Wavelengths, Relativistic Factors, 
and Physical Constants 


Table E.1 lists (as a function of the kinetic energy Eo of an electron) values of 
its wavelength A, wave number ko, velocity v, relativistic factor y, effective kinetic 
energy 7, and the parameter 2y T used to calculate the characteristic scattering angle 
0g = E/(2yT). For values of Ep not tabulated, these parameters can be calculated 
from the following equations: 


ko = ymov/h = 2590(yv/c) nm! 
y=(1—v/c*)7!/? = 14 Ep/(moc?) = 1 + Ep/(511.00 keV) 


v _ Eo(Eo + 2moc?) 


ce (Eo + moc?)” 
= mov- 7 l+y 5A 1+ Eo /(2moc?) 


° [! ra Eo/(moe?)). 


Eo + 2moc? 

2yT = ymov* = Ey | ———— 

Y ymo (Bare 
a ae rg _ E [ Eyt+imoe? 
= QyT ymv? Ed +ty—)) Eo \Eo + 2moc2 


Values of the fundamental constants for use in these (and other) equations are given 
in Table E.2. 
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Table E.1 Electron parameters as a function of kinetic energy 


Eo ko = 20/d T = mov" /2 2yT 
(keV) A (pm) (nm7!) v2 /e y (keV) (keV) 
10 12.2 514.7 0.0380 1.0196 9.714 19.81 
20 8.59 731.4 0.0739 1.0391 18.88 39.34 
30 6.98 900.2 0.1078 1.0587 27.55 58.34 
40 6.02 1044 0.1399 1.0782 35.75 77.10 
50 5.36 1173 0.1703 1.0978 43.52 95.56 
60 4.87 1291 0.1991 1.1174 50.88 113.7 
80 4.18 1504 0.2523 1.1565 64.50 149.2 
100 3.70 1697 0.3005 1.1957 76.79 183.6 
120 3.35 1876 0.3442 1.2348 87.94 217.2 
150 2.96 2125 0.4023 1.2935 102.8 266.0 
200 2.51 2505 0.4835 1.3914 123.6 343.8 
300 1.97 3191 0.6030 1.5870 154.1 489.1 
400 1.64 3822 0.6854 1.7827 175.1 624.4 
500 1.42 4421 0.7445 1.9784 190.2 752.8 
1000 0.87 7205 0.8856 2.9567 226.3 1338 


Table E.2 Selected physical constants 


Quantity Symbol Value Units 


Electron charge e 1.602 x 107!9 Cc 
Electron rest mass mo 9.110 x 1077! kg 
Electron rest energy moc? 511.00 eV 
Atomic mass unit (1/Na) u 1.661 x 10727 kg 
Bohr radius (471 egh?(mge)~ !) ao 5.292 x 1071! m 
Rydberg energy (h?(2moao~)~!) R 13.61 eV 
Photon energy x wavelength hc/e 1.240 eV um 
Avogadro number Na 6.022 x 1073 mol! 
Boltzmann constant k 1.381 x 10773 JK"! 
Speed of light in vacuum c 2.998 x 108 ms! 
Permittivity of space €0 8.854 x 107? Fm! 
Permeability of space Lo 1.257 x 10-6 Hm"! 
Planck constant h 6.626 x 10774 Js 
hl h 1.055 x 10-4 Js 


1 mmol/kg © 12 ppm (atomic) for dry biological tissue (assuming mean Z ~* 6) 
1 mmol/kg © 1 mM * 18 ppm (atomic) for wet biological tissue (mainly H2O) 


Appendix F 
Options for Energy-Loss Data Acquisition 


Table F.1 summarizes some of the procedural choices involved in the recording of 
energy-loss data. As discussed on p. 291, there are several ways of using the infor- 
mation contained in inelastic scattering. An energy-loss spectrum provides much 
quantitative information, such as the local thickness (p. 293), chemical composition 
(p. 269, 324), and the crystallographic and electronic structure (Section 5.6) of a 
defined region of the specimen. Energy-filtered imaging is more useful for show- 
ing variations in thickness, composition or bonding, or simply for optimizing the 
contrast arising from structural features (Section 5.3). A spectrum image (p. 103) 
combines the spatial and energy-loss information and allows sophisticated proce- 
dures such as multivariate statistical analysis (p. 265) to be applied to previously 
acquired data. Energy-filtered diffraction can be useful for the quantitative interpre- 
tation of diffraction patterns (p. 317), for examining the directionality of chemical 
bonding (Fig. 3.60) or for finding out which scattering processes contribute to the 
energy-loss spectrum of a particular specimen (Section 3.3). 

The three basic types of TEM-EELS systems were described earlier (Fig. 2.30). 
A spectrometer mounted beneath a TEM column is the most common choice for 
acquiring energy-loss spectra; the Gatan GIF system also provides energy-filtered 
images and diffraction patterns. An in-column filter has the advantage that an 
energy-filtered image can appear on the large fluorescent screen of the TEM, in 
addition to a CCD monitor. Spectroscopy and spectrum imaging are possible, but 
the latter is less dose efficient than the equivalent STEM technique (p. 106) and 
extracting a spectrum with good energy resolution may require specimen-drift cor- 
rection and interpolation (p. 104). The relative advantages of the TEM and STEM 
for acquiring energy-filtered images and spectrum image data are discussed in 
Section 2.6.5. 

High accelerating voltage maximizes the beam current available in a small probe 
and makes it easier to obtain good spatial resolution, although aberration correc- 
tors relax this requirement. Since high incident energy Ep is equivalent (in terms 
of the amount of scattering) to a thinner specimen, the signal/background ratio at 
ionization edges is improved, reducing the need for deconvolution and making quan- 
titative analysis more feasible. Low Eo increases the intensity of inelastic scattering 
relative to the zero-loss peak, reducing the deleterious effect of its tail on low-loss 
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Table F.1 Options involved in the acquisition of energy-loss data 


Parameter 


Type of data 


Type of 
instrumentation 


Incident energy 


TEM mode 


Collection angle 


Energy dispersion 


Recording time 


Number of 
readouts 


Options 


Energy-loss spectrum 
Energy-filtered image 
Spectrum image 
Energy-filtered DP 


Spectrometer below TEM 
In-column filter 
Dedicated STEM 


High (e.g., 200 keV) 
Low (e.g., 60 keV) 


Image on TEM screen 
DP on TEM screen 


B < 10 mrad 
B > 100 mrad 


dE/dx > | eV/channel 
dE/dx < 0.1 eV/channel 


Short 
Long 


Small 
Large 


Main advantages 


Quantitative data from a defined area 
Spatial distribution, at least qualitative 
Large information content 

Reveals physical processes 


Convenient for spectroscopy 
Convenient for EFTEM 
Ideal for spectrum imaging 


Ionization edges more visible 
Reduced damage and Cerenkov effects 


Easy spatial location of spectrum 
More precise spatial determination 


Dipole conditions, high edge jump ratio 
Good for E > 1 keV and log ratio method 


Good for high energy losses 
Improves low-loss energy resolution 


Less drift, less radiation damage 
Lower shot noise, better statistics 


Low readout noise 
Large dynamic range, drift correction 


spectroscopy (e.g., bandgap measurement, p. 368). Lower voltage also reduces any 
Cerenkov contribution below 5 eV (p. 154, 369) and reduces possible knock-on 
damage (atomic displacement or sputtering from surfaces), even if Ey exceeds the 
threshold energy. 

In the case of a below-column spectrometer, TEM image mode (p. 63) makes it 
easy to see what region of a specimen is giving rise to the energy-loss spectrum, sim- 
ply by lowering the viewing screen. However, aberrations of the imaging lenses may 
preclude precise spatial localization (p. 64), whereas TEM diffraction mode allows 
regions of diameter down to | nm (or even below) to be defined by means of a very 
small probe. Alternatively, diffraction mode with a large-diameter beam provides 
high spectral intensity (useful for core-loss spectroscopy) because the spectrum 
contains contributions from the entire beam area, not limited by the spectrome- 
ter entrance aperture (Fig. 2.16). In diffraction mode, the center of the diffraction 
pattern must be aligned to the center of the spectrometer entrance aperture, usually 
by manual adjustment for maximum intensity (p. 64). 

The spectrum collection semi-angle 8 is determined by a TEM objective aperture 
or, in diffraction mode or a dedicated STEM, by a spectrometer entrance aperture. 
Small B increases the signal/background ratio at an ionization edge and allows the 
use of dipole formulas (necessary when measuring thickness using the Kramers— 
Kronig sum rule, for example; p. 302). Large 6 simplifies thickness measurement 
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by the log ratio method (p. 301) and is useful for ionization edges above | keV, to 
obtain adequate intensity. 

Long recording time of the spectrum minimizes statistical (shot) noise, of prime 
importance for recording ionization edges. One limit comes from saturation of the 
electron detector, which typically limits the time to fractions of a second in the low- 
loss region, especially if the zero-loss peak is included. This limit can be extended 
by combining multiple readouts (p. 93), possibly at the expense of readout noise 
(p. 91, 95). Multiple readouts also allow correction for energy drift arising from 
change in accelerating voltage or spectrometer current, the data-acquisition com- 
puter being programed to recognize and align some prominent spectral feature. 
Another time limit comes from specimen drift, which sometimes can also be com- 
pensated electronically. A more fundamental limit arises from radiation damage, 
whose severity depends very much on the type of specimen (p. 389). Damage in con- 
ducting samples (e.g., metals) arises from knock-on processes and can be reduced 
by lowering the incident energy, ideally below some damage threshold (p. 122, 396). 
Damage in organic and inorganic compounds is usually due to radiolysis, and can 
be reduced by a modest factor (e.g., 3) by cooling the specimen. 
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Sputtering, 3, 16-17, 111, 122-123, 331, 342, 
385, 390, 393, 395-397, 406, 430 
Stabilization of energy-loss spectrum, 42, 57, 
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Sub-pixel scanning, 108 
Sulfur analysis, 334, 386 
Sum rule 
Bethe, 132, 301, 304-306, 385 
Kramers—Kronig, 245, 297, 302-303, 410, 
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Thickness measurement, 294-295, 416 

Thermal spread, 39-44 
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Thomas—Fermi model, 124 

Thomas-—Reiche—Kuhn sum rule: see Bethe 
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Time-resolved spectroscopy, 26 
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Vibrational modes, 1, 11, 39, 41 
Void resonances, 168 
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Vortex beam, 360 
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Wake potential, 140 
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X-ray absorption fine structure (EXAFS), 18, 
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YBCO superconductor, 375-377 
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